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METHODS FOR PREPARING SYNTHETIC REPETITIVE DNA 

The government has certain rights in this invention as a result of suppon 
provided by the Department of the Navy for the work leading to the present 
invention. 

5 rROSS.REFERF NrF TO RELATED APPLICATIONS 

This application is a continuation-in-part of Application Serial N o. 
08/175,155, filed December 29, 1993, which application is a continuation-in-part of 
Application Serial No. 08/053,049, which application through intervening 
applications, is a continuation-in-part of Application Serial No. 927,258. filed 

10 November 4, 1986, now abandoned. 

INTRODUCTION 

Technical Field 

The field is related to the production of high-molecular-weight polymers, 
15 either nucleic acids or peptides that are the expression products of the nucleic acids, 
and is particularly related to the production of high-molecular-weight peptides 
containing repeating sequences by biochemical processes, the peptides fmding use as 
structural materials. 

20 Background 

Recombinant DNA technology has been applied in the isolation of natural 
genes and the expression of these genes in a variety of host cells. Typically, this 
technology has had utility in producing biologically active polypeptides, such as 
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interferons or peptide hormones, which were impractical to produce in useful 
amounts by other means. It was also possible to produce modified proteins by 
isolating natural genes and utilizing the techniques of site specific, in vitro 
mutagenesis to alter these genes and thereby change the polypeptides produced. 

5 Other polypeptides have been created by combining sections of various native genes 
to produce new polypeptides that are chimeric molecules of the several naturally 
occurring molecules. 

With the advent of efficient and automated methods for the chemical 
synthesis of DNA, it has become possible to synthesize entire genes and to modify 

10 such synthetic genes at will during the course of synthesis. However, these various 
technologies have been applied to the production of natural or modified ve rsions of 
namral polypeptides. There have been very few attempts to use these technologies 
to create substantially new polypeptides. In namre, polypeptides have a \v'ide range 
of chemical, physical and physiological characteristics. Nevertheless, there are 

15 commercial applications for which known, naturally occurring polypeptid(ts are not 
appropriate. 

While biotechnology is versatile, usually it has been limited in its 
applications to naturally occurring products or modifications of naturally occurring 
molecules. One great strength of organic chemical synthesis, by contrast, has been 
20 the ability to transform inexpensive carbon materials to a wide variety of jpolymeric 
molecules, including nanirally occurring molecules, but most importantly entirely 
new chemical strucmres, such as polypropylene and polyacrylates, which have 
defined and predicted chemical properties not associated with namrally occurring 
molecules. 

25 Such materials, panicularly high-molecular-weight polymers conUiining 

repeating sequences of amino acids, have proven difficult to produce by Itiochemical 
means. The genes necessary for producing large peptides containing repeating units 
of amino acids were unstable and often underwent intermolecular recombination 
causing deletions of repeating units in the gene. The development of a 

30 biotechnology which would produce polymeric molecules by biological processes 
similar to those available by organic synthesis would significantly broaden the range 
of applications of biotechnology. 
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The cloning of multiple lactose operators up to four in tandem is disclosed by 
Sadler et al.. Gene (1980) 8:279-300. Hybrid bacterial plasmids containing highly 
repeated satellite DNA is disclosed by Brutlag et al.. Cell (1977) 10:509-519. The 

5 synthesis of a poly(aspartyl-phenylalanine) in bacteria is disclosed by Doel et al., 
Nucleic Acids Research (1980) 8:4575-4592. A method for enriching for proline 
content by cloning a plasmid which codes for the producuon of a proline polymer 
was disclosed by Kangas et al. , Applied and Environmental Microbiology (1982) 
43:629-635. The biological limitations on the length of highly repetitive DNA 

10 sequences that may be stably maintained within plasmid replicons is discussed by 
Gupta et al, in Bio/Technology, p. 602-609, September 1983. 

5;t]mmary of the invention 

Methods are provided for the production of protein polymers having 

15 extended stretches of small repeating units by expression of a synthetic gene. The 
amino acid repeating units are sequences that provide a motif for the protein 
polymer and comprise a major portion of the gene encoding the protein polymer. 
There may be more than one type of amino acid repeating unit in a single protein 
polymer. According to the design of the protein polymer, one or more different 

20 amino acid repeating units and, optionally, one or more amino acid intijmipting 
linker or spacer sequences are organized into a "monomer". In the finiil protein 
polymer, the amino acid monomer is sequentially replicated to achieve the desired 
molecular weight. 

To construct the gene encoding the protein polymer, a DNA monomer 

25 sequence encoding the amino acid monomer sequence is first designed and 
synthesized. There are three different approaches to synthesizing the DNA 
monomer: (1) synthesizing a plurality of dsDNA segments, which when ligated 
either in conjunction with their synthesis or after cloning and subsequent restriction 
enzyme digestion correspond to the desired DNA monomer sequence. Each dsDNA 

30 segment will typically encode a few amino acid repeating units, although the 
segment may encode an amino acid interrupting linker or spacer sequence. The 
dsDNA segments are synthesized by synthesizing single stranded oligomers which at 

3 
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least partially overlap and hybridizing pairs of oligomers to provide dsDNA. The 
dsDNA monomer is then assembled by independently cloning each dsDNA segment 
in a cloning vector and then sequentially inserting into a cloning vector containing 
all or a portion of a first segment all or a portion of each additional dsDI4A segment 
5 by restriction enzyme digestion and ligation, by simultaneously cloning in a cloning 
vector all of the dsDNA segments where each individual segment has a 3 ' or 5' 
terminus complementary to the 5' or 3' terminus of a second segment, aind so forth, 
or by combining convenient elements of these methods whereby a monomer is 
obtained with an open reading frame with the proper sequence, which monomer is 
10 sequenced; or (2) synthesizing a single strand of all or a portion of the DNA 

monomer and making the complementary strand, conveniently using the polymerase 
chain reaction (PCR) in which case the synthesized PGR template compiises the 
appropriate sequence of the ssDNA monomer flanked by appropriate primer binding 
site sequences. The resulting dsDNA segment(s) are then digested by restriction 
15 enzyme digestion to remove the primer binding sites, cloned in a cloning vector, and 
if more than one segment has been synthesized, assembled in a cloning vector as 
described above to form the monomer, which is sequenced; or (3) using the 
appropriate restriction enzymes, deleting a portion of a DNA monomer or 
combining all or pan of DNA monomers previously prepared as in (1) and/or (2) 
20 above, optionally including new dsDNA segments synthesized as above, and 
characterizing the monomer. Generally, the dsDNA segments will be sequenced 
after each cloning step. dsDNA segments prepared as in (1) and/or (2) above can be 
combined to form a monomer, which is sequenced. In those cases where new DNA 
is synthesized and introduced into a monomer, at least the new DNA and usually the 
25 entire monomer, will be sequenced. 

The monomers have predetermined termini for oligomerization The 
monomer is then concatenated or oligomerized under ligating conditions to form 
multimers of the monomer, where the multimers may have different numbers of 
monomers resulting in a plurality of genes having different numbers of monomers. 
30 At least one multimer is inserted into an expression vector for introducing the vector 
into an appropriate expression host for expression of the gene. The expression host 
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is then grown under conditions whereby the protein is expressed and may be 
isolated. 

RRTFF PF^rRiPTTQ I^ n? TH? PTfilJRES 
5 Fig. 1: Plasmid pSY701 strucmre. 

Fig. 2A-B: Immunoblots of polypeptide products using antibwly to (a) beta- 
lactamase or to (b) gly-ala-peptide. 

Fig. 3: Construction flowchart for plasmid pGlO/SlpI. 
Fig. 4A-B: Immunoblots of polypeptide products (a) T7gplO/;SlpI with anii- 
10 SlpAb. (b) T7gp9/Slpl with anii-Slp ab or (c) staining with Coomassie blue. 
Fig. 5: Construction flowchart for plasmid pSY856. 
Fig. 6: Time course for accumulation of the kanamycin-resiswnce gene 
product with the T7 system. 

Fig. 7: Construction flowchart for plasmid pSY857. 
15 Fig. 8: Construction flowchart for plasmid pSY980. 

HG. 9A-B: (A) Amido black stain or gel containing the product of beta- 
galactosidase/SlpIIl gene fusion; (b) immunoblot of same product with anti-Sip 
antibody. 

Fig. 10: Construction flowchart for plasmid pSY1280. 

20 

Pl^f^ ^piiynnN of sPEn ^<" FMRoniMENTS 
Novel polypeptides are provided which are block polymers of repeating. 

relatively short, amino acid sequence units. The blocks of repeating units 

(oligomers) may be linked by spacers of different amino acid sequences. The 
25 polypeptides may contain only one or a plurality of repetitive amino acid sequences 

(having the same or different amino acid sequences). The novel polypeptides are 

particularly useful as fibrous or strucmral proteins, including crystalline. 

elastomeric. tough and bony materials, e.g. proteins similar to . but different from. 

silk, elastin. collagen, keratin or other naturally occurring structural polymers 
30 having a repetitive amino acid sequence motif. The gene encoding the repeaung- 

unit-containing peptides is produced to particularly avoid problems previously 

associated with genes conuining multiple repeating units. 
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Genes produced according to the methods described herein will generally be 
at least 900 nt in length, usually at least 1200 nt in length, preferably at lest 1500 nt 
in length, usually not more than 20 knt in length, more usually not mor«! than 12 km 
in length, frequently not more than about 6 knt in length. This will usually result in 
5 a protein of from about 30 kDal. usually at least 35 kDal, and not more than about 
250 kDal, more usually not more than about 125 kDal. The methods fcir production 
of the synthetic genes encoding the protein polymer involve preparation of a dsDNA 
"monomer", which is an extended segment of DNA principally encoding amino acid 
repeating units, where the dsDNA monomer is generally a repeating segment of the 
10 fmal product, where the final product will have from 2. frequently at least 3, and up 
to 50, usually not more than about 30, more usually not more than about 20. 
monomeric units. There is one exception, to be described below, where the 
monomer may be the entire final repeating unit gene. The monomer will be a 
dsDNA whose sequence is. with one exception, established prior to its 
15 multimerization to provide the gene. 

The size of the dsDNA monomer is dependent upon the desired amino acid 
monomer sequence as well as the way in which the monomer is obtain<:d. If the 
monomer is constructed using any newly synthesized and ligated DNA, then the 
monomer is always sequenced prior to multimerization and the practical limitations 
20 of DNA sequencing technology limit the monomer size to about 500 m:. usually 
about 400 nt. If the gene monomer is constructed solely from digestion fragments 
of previously constructed and sequenced monomers, then the final gene monomer is 
typically characterized by restriction digests. Therefore, the gene monomer can be 
as large as the final gene, depending upon the desired amino acid repeating unit 
25 sequences and periodicity. 

Because of the nawre of the subject genes, which involve long tracts of 
repetitive units encoding the same amino acid sequence, the manner in which the 
monomer gene may be successfully prepared is restricted and requires that there be 
certainty as to the fidelity of the sequence. In the initial methods of preparation, 
30 there were a number of stages: preparation of ssDNA pairs, which overlapped and 
when hybridized provided segments; cloning of segmems; sequencing, of segments to 
ensure the fidelity of the sequence; and combining the segments to form the 
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monomer. The order and manner in which these various operations were carried out 
could be varied, depending on the nanire of the gene and the protein i t encoded. In 
order to reduce the possibility of undesirable recombination events, different nucleic 
acid sequences are used to encode the same repetitive unit, 
5 There are three ways to obtain the monomer. The first way relies on 

synthesis and assembly of single stranded deoxynucleotide oligomers encoding from 
about 1 to 12, more usually 1 to 9, frequently 1 to 6 repeating amino acid units into 
a dsDNA monomer sequence. Each repeat unit will have about 3 to 30 codons (9 to 
90 bases), usually about 3 to 25 codons, more usually about 3 to 15 codons, 
10 frequently not more than 9 codons, particulariy when mimicking a namrally 
occurring motif. The number of amino acid repeat units in a dsDNA monomer 
sequence will depend to a substantial degree on the size of the repeating unit. 
Conveniently, ssDNA oligomers may be prepared having from about 15 to 120 
bases, usually about 21 to 90 bases, more usually about 39 to 72 bascts, although 
15 oligomers may be prepared with up to 300 bases, more usually up to about 252 

bases. For repeating units having a few amino acids, usually in the range of 3 to 15 
amino acids, more usually in the range of 3 to 9 amino acids, the single stranded 
oligomer will conveniently have from about 2 to 12 amino acid repealing units. 

The number of different single stranded oligomers will usually be at least 2, 
20 forming 1 pair, more usually about 6, forming 3 pairs, or may be 8 or more, 
forming 4 or more pairs, where the protein polymer has the same repeating unit. 
Where block copolymers are prepared, the number of oligomers will depend on the 
number of different blocks and the size of the blocks. The dsDNA segments formed 
by the pairs of oligomers of the different ssDNA oligomers may encode the same 
25 amino acid sequence or a different amino acid sequence, but where more than one 
dsDNA segment is synthesized, at least two segments will have different nucleotide 
sequences. Each pair of oligomers forming a dsDNA segment are complementary 
and at least partially overiap, providing blunt or cohesive (protruding) ends, 
preferably protruding ends, to allow for ease of assembly and ligation of the dsDNA 
30 to form a "monomer". 

The dsDNA segments are desirably assembled in a prokaryotic vector by 
linearizing a vector having an origin and convenient restriction sites, which may 

7 
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involve a polylinker, for insenion of one or more dsDNA segments. The vector will 
also have a marker gene for selection, which will usually impart antibiotic 
resistance, but may afford another distinguishing characteristic, e.g. chromophore or 
fluorophore formation. The marker wiU preferably provide antibiotic resistance. 

5 there being a wide variety of antibiotic reagents, e.g. tetracycline, chloramphenicol, 
actinomycin. neomycin, ampicillin, hygromycin. heavy metals, etc. Other markers 
include B-galactosidase, which, with the substrate X-gai, provides a blue color. 
Numerous vectors are commercially available for cloning in E. coli and need not be 
exemplified here. The vector is then introduced into an appropriate cloning host by 
10 any convenient means, including calcium phosphate precipitated DNA, fusion, 
transfection. conjugation or the like. The cells are then grown in an appropriate 
selective nutrient medium. Surviving cells are harvested, lysed and the ]3lasmid 
isolated. 

By having a multiplicity of dsDNA segments, the termini may be: designed 
15 that the first segment has a 3' or 5' terminus complementary to the 5 ' or 3' terminus 
of a second segmem and so on, where the termini may have differem consensus 
sequences for different restriction enzymes or not be recognized by any known 
restriction enzyme. The termini of the dsDNA segments may be selected to have 
protruding 5' ends, protruding 3' ends, or a protruding 5' and a protruding 3' end 
20 on the same strand, either the coding strand or the non-coding strand. 

Complementation of the protruding ends may destroy the sequence of the restriction 
site or retain the sequence, when different dsDNA segments are ligated 

Restriction enzymes are used to digest the dsDNA of the cloning vector and 
insert dsDNA segments. Restriction enzyme digestion of the vector, whether or not 
25 already having inserted dsDNA segments, will provide termini which are 

complementary to the termini of the next dsDNA segment being inseruid, which 
may already be a combination of two or more synthesized dsDNA segments. In 
selecting dsDNA segment sequences, one generally selects the terminal sequences to 
allow for linearization of the vector and insertion of the next dsDNA segment, 
30 proximal to a terminus of a segment. However, sometimes it is convenient in 

creating a monomer to insert a dsDNA segment within a previously cloned segmem. 
The final dsDNA segment sequence which is inserted into the cloning vector may 
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not be the entire dsDNA segment which was synthesized, but will have the 
appropriate complementary lerniini that allow for insertion formed by the restriction 
enzyme digestion. Similarly, digestion of the vector may create the appropriate 
termini for insenion of the next dsDNA segment by restriction enzyme digestion that 
5 deletes a portion of the originally synthesized and cloned DNA. Geneirally, in 
building a monomer it is preferable to cleave the vector with a single rssiriction 
enzyme corresponding to a single restriction enzyme recognition site so that the use 
of partial enzyme digestions can be avoided. 

By having different termini at each end of each dsDNA segment, the 
10 individual segments cannot oligomerize, even if they have been phosphorylated. In 
this way, when the different segments are combined, the ends of the combination of 
the segments may have complementary termini, so that they can be oligomerized. 
The 3' and 5' termini of each dsDNA segment are generally selected so that only 
one copy of the segment can be cloned in one orientation into a cloning vector. 
15 However, sometimes it is convenient to have complementary 3' and 5' termini, so 
that a segment can be cloned into an existing monomer or portion of a monomer 
even though it is then necessary to select a clone having the segment inserted in the 
correct orientation and number. During the construction of the monomer, some 
combinations of dsDNA segments may not be in reading frame, one to the other. 
20 However, by appropriate selection of the dsDNA segment sequences and the 
restriction enzymes used in the monomer construction, the final combiination of 
dsDNA segments comprising the monomer will be in a continuous op<!n reading 
frame coding for the desired amino acid sequence. The above-described approach 
may be used with the other methods of forming the monomer, as will be described 
25 hereafter. 

Using these techniques and design strategies, it is possible to construct the 
dsDNA monomer in a variety of ways, as described in the examples ihzt follow. In 
one variation, a first dsDNA segment is cloned into the cloning vector after it has 
been linearized by restriction enzyme digestion. After cloning, the fiirst dsDNA 
30 segment is characterized, such as by restriction analysis and sequencing. Where the 
dsDNA segment is relatively small, sequencing can be performed rapidly and 
substantially error free. 

9 
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Once the first dsDNA segment has been shown to have the correct sequence, 
the vector may then be used in the next stage in the preparation of the gem. The 
vector is linearized at the 5' or 3' terminus of the first dsDNA segment cloned. By 
employing a polylinker in the vector at the 5' and/or 3' terminus of the dsDNA 
5 segment cloned, the vector may be digested by using a restriction enzyme which 
cleaves in the polylinker to provide a terminus at the 5' or 3' terminus of the vector 
complementary to the 3' or 5' terminus of the next dsDNA segment. Alternatively, 
one may use restriction enzymes which cleave distal from the consensus ajquence. 
In this way the vector may be repeatedly cleaved and ligated, without cleaivage of 
10 the monomer DNA being constructed. After cloning, the combined dsDMA 

segments may be characterized as described above. The process may be repeated 
until all of the dsDNA segments have been inserted and verified for sequc:nce and 
being in the proper order and reading frame. 

In another variation, two or more dsDNA segments may be sequentially 
15 cloned as described above, with each new insertion at the 3' or 5' terminus of the 
previously cloned segment, and then another dsDNA segment may be insened 
between the previously cloned segments. In another variation, a first dsE^NA 
segment may be cloned as described above and then another dsDNA segment may be 
insened internal to the cloned segment. In another variation, two or more dsDNA 
20 segmenu may be cloned simultaneously into a vector, with additional dsDNA 
segments sequentially insened a) at the 3' or 5' termini of the previously cloned 
DNA segments, b) between the previously cloned DNA segments, or c) internal to a 
previously cloned segment. In another variation, all dsDNA segments comprising 
the monomer may be simuluneously ligated into the cloning vector. In another 
25 variation, each dsDNA segment comprising the monomer may be individlually 
cloned and characterized The individual dsDNA segments are then purified and 
ligated in a single cloning step to construct the monomer, which is sequenced. 

An essential element in this method for constructing the monomer is that the 
pairs of ssDNA be annealed into dsDNA segments prior to further manipulation. 
30 Generally, each dsDNA or combination of segments, once cloned, is sequenced 
prior to further manipulation. The monomer is always sequenced prior lo 
multimerization. 

10 
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A second approach depends on the synthesis of a single strand of all or a 
portion of the monomer. Synthetic techniques allow reasonably accurate 
oligonucleotide synthesis of 300 bases or more. For the most pan the single strand 
will be in the range of about 100 to 300 bases, usually in the range of ai[x)ut 100 to 

5 250 bases. The single strand is then used to produce a complementary j;trand, 
conveniently using the polymerase chain reaction ("PCR"). The synthesized PCR 
template comprises the appropriate sequence of the ssDNA monomer flanked by 
appropriate primer binding site sequences. 

The primers which are used for PCR are designed not to hybridize readily to the 

10 repeat sequences, having overall a nucleotide sequence substantially different from the 
nucleotide sequence of the sequence being amplified, although a portion of the primer 
desirably includes a sequence common v«th the non-repetitive portion of the sequence 
being amplified, this will be not more than about 30 nucleotides, usually nox more than 
about 25 nucleotides and at least 10 nucleotides, usually at least 12 nucleotides. The 

15 total number of nucleotides in the primers will generally be in the range of about 1 5 to 
50, more usually 20 to 45. In the portion of the primer hybridizing with the target, 
none or a portion of a restriction enzyme consensus sequence is included in the primer 
which binds to the target ssDNA, where only a portion is present, the target will 
complete the consensus sequence. Restriction enzyme digestion allows f cleavage at 

20 the terminus or proximal to the terminus of the primer sequence hybridizing to the 

target ssDN A. For the most part, where a portion of the restriction enzyme consensus 
sequence is included in the primer, restriction enzyme digestion will maintain the 
monomer amino acid motif Preferably, not more than about 5 nt, usually not more 
than about 3 nt, preferably not more than 2 nt, will be left fi-om the primer after clevage 

25 by restriction enzyme digestion. Usually, the consensus sequence will be 6 to 8, usually 
6 nucleotides for the restriction enzyme. The 3' and 5' primers will be different, so 
as not to hybridize with each other. In order for the primers to work efficiently, the 
primers must have similar T„ and AG characteristics for hybridizing v^th their 
respective complementary DNA sequences. These characteristics can be controlled by 

30 the length of the primers, the length of the sequence hybridizing with the target ssDNA, 
and the proportion of GC binding that occurs. Generally the primers will have at least 
40%, more usually at least 45% G and C, and may be 50% or more, usually, not 
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The third approach relies on the use of ftilly characterized dsDN A which is 
already present in a monomer, previously prepared by either of the methods 
described above. Using this approach allows for great flexibility in constructing 
new monomers, particularly where copolymers comprising different amino acid 

5 repeating units are desired. Using the appropriate restriction enzymes, all or part of 
the dsDNA comprising a monomer may be purified. In some instances;, all of a 
ponion of a previously synthesized monomer is then used as a new momomer in 
combination with newly synthesized dsDNA segments prepared as in (:i) and/or (2) 
above or a portion of a previously synthesized monomer is then used xi a new 

10 monomer by itself. In other instances, the desired dsDNA from two or more 
separate monomers may be combined to construct a new monomer encoding the 
amino acid repeating units of interest, either by themselves or in combination with 
newly synthesized dsDNA segments prepared as in (1) and/or (2) above. The 
digested monomer DNA fragments which are to be combined may have 

15 complementary or non-complementary ends. If the termini of the monomer 
sequences are not complementary, as required, the termini may be made so by 
employing adapters, filling in, nuclease digestion, or the like. Once the appropriate 
monomer sequences have been cloned together, sequentially and/or simultaneously, 
to make the new monomer, the monomer is then characterized and sequenced, if 

20 necessary. If newly synthesized adapters or filling in reactions or nuclease digestion 
or the like are employed, the region comprising the modified monomer DNA is 
sequenced. 

When the gene encoding the desired protein product is a homooligomer of 
the monomer, desirably the termini have cohesive ends and may retain the same 

25 restriction site consensus sequence or result in a sequence other than tlie consensus 
sequence. By appropriate choice of the restriction enzymes or polylinker, the 
termini of the monomer may have the same or different terminal restriction sites, but 
will have complementary ends, if the monomer is to be multimerized. Preferably, a 
single restriction enzyme that cleaves the monomer from the vector at asymmetric 

30 consensus sites will be employed. However, restriction enzymes that cleave outside 
of the recognition sites may also be used. By having a monomer with, different, yet 
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complementary termini, the monomer may be ligated in vitro with the monomers 
only assembling in one orientation. 

As evidenced by the above description, the monomer is a molecule which 
will normally be comprised of a plurality of previously prepared dsDNA segments, 

5 normally being formed from at least two different dsDNA segments, which may or 
may not encode the same amino acid sequence, but generally providing for blocks of 
the same pattern of repeat amino acid units throughout the final polymer g:ene (the 
exception is when the monomer is the gene). Thus the monomer may provide for a 
homopolymer, copolymer, or polymer having a defined motif, where the amino acid 

10 repeating units vary, e.g. collagen. 

The monomer is then multimerized by ligation, conveniently employing from 
about 0.01 to 100 Mg of the monomer under ligating conditions, where multimers 
having different numbers of monomers are obtained. The multimers may then be 
segregated by size, selecting multimers of a predetermined size. Any of the original 

15 mixmre, the partially purified mixmre, or size segregated fractions thereof, may 
then be introduced into a vector. Either an adapter vector or an appropriate 
expression vector is employed. The adapter vector has a poly linker which will allow 
for insertion into the polylinker, so as to be capable of being read in any i eading 
frame. In this way one may introduce different unique restriction sites which allow 

20 for excision and transfer of the multimer gene from the expression vector. The 
multimer gene may be characterized and purified before transfer to the expression 
vector. The multimer will have appropriate termini which will allow for insenion 
into the vector and, as appropriate, have end groups which are present in the vector 
or be inserted with termini which will allow for the exact excision of the gene. One 

25 may select a particular sized multimer or a plurality of multimers of different size 
for expression, so that one has a family of protein polymers, sharing the !;ame 
repeating motif. 

The expression vector will be characterized by having an origin of replication 
which is functional in an appropriate expression host, usually for episomal 
30 maintenance, and a marker for selection. Markers as described above may find use. 
For unintegraied vectors or constructs, the origin of replication will usually provide 
for multicopies, usually greater than about 5 copies on the average. 

14 
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The expression vector will also have a promoter which is funciianal in the 
expression host. Various promoters can find use, which provide for a liigh level of 
transcription, either inducible or constitutive transcription. Illustrative promoters 
include p-lactamase, p-galactosidase, k?^ or XP^ promoters, trpE promoter, trp-lac 
5 promoter, T7 promoter (particulariy genes 9 and 10), cP. etc. The multimer gene 
and the linearized vector may be combined under hybridizing, usually including 
ligating, conditions. Where the multimer gene does not have an initiation codon, 
such a codon can be added. More convenienUy, the multimer gene mi\y be inserted 
into a coding sequence present in the vector, under the transcriptional conttol of a 
10 promoter. The coding sequence in the vector will generally not exceed 200 bp, 
usually not exceeding about 60 bp. where the site into which the multimer gene is 
inserted has the coding sequence and multimer gene in proper reading frame. 
Generally, the coding sequence present in the vector will be not more than 20%, 
usually fewer than 10%, preferably fewer than about 5% of the total rumber of 
15 bases in the coding sequence. 

A signal sequence may be present at the 5' terminus of the coding sequence 
to allow for secretion of the protein polymer into the periplasmic space. For the 
most part, the product will be produced intracellularly. 

Instead of a vector, DNA constructs may be employed for transformation of 
20 the expression host, with integration of the construct into the genome of the 

expression host. The construct will differ from the vector primarily by lacking an 
origin which provides for episomal maintenance. Thus, the construct will provide at 
least transcriptional and translational initiation and termination regions, the gene 
encoding the protein polymer between the initiation and termination regions and 
25 under their regulatory control, a marker for selection as described above, and other 
functional sequences, such as homologous sequences for integration into the host 
genome, sequences for priming for the polymerase chain reaction, restriction sites, 
and the like. 

For the most part, the expression host will normally be unicellular, 
30 prokaryotic or eukaryotic, but may be from a multicellular organism. The organism 
may be selected from bacteria, algae, fungi, insect cells, plant cells, etc. Illustrative 
hosts include £. colL B. subtilis, B. stearothermophilus, S. cerevisiae., and the like. 

15 
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The expression host is then grown in accordance with conventional ways in 
an appropriate medium in culture, e.g. fermentation. After the cells have been 
grown to an appropriate density, the cells may be harvested, lysed and iJie product 
isolated by appropriate means, in accordance with the physical and chemical 
5 characteristics of the product. In some instances, the product is insolutile at 
moderate temperatures in an aqueous medium, and may be purified by detergent 
extraction at mildly elevated temperatures, above about 60'C, see U.S. Patent No. 
5,235,041 . As appropriate, the crude or purified product may then be used for its 
intended purpose. 

10 The genes of the subject invention generally comprise concatemited 

monomers of DNA encoding the same amino acid sequence, where all or a part of 
two or more different monomers encoding different amino acid repeating units may 
be joined together to form a new monomer encoding a block copolymer. The 
individual amino acid units will have from 3 to 30 amino acids (9 to 90 nt), usually 

15 3 to 25 amino acids (9 to 75 nl), more usually 3 to 15 (9 to 45 nt) amino acids, 
usually having the same amino acid appear at least twice in the same unit, generally 
separated by at least one amino acid. In some instances, the minimum number of 
amino acids will be 4. Within a monomer, dsDNA encoding the same amino acid 
repeating unit may involve two or more nucleotide sequences, relying on the codon 

20 redundancy to achieve the same amino acid sequence. 

For the most pan the DNA compositions of this invention may be depicted 
by the following formula: 

K, (W M, X, N. Y,), L, 

wherein: 

25 K is a DNA sequence encoding an amino acid sequence of from about 1 to 

125 amino acids, usually .1 to 60 amino acids, which may be any sequjnce 
depending upon the manner of preparation of the construct and the puipose of the 
protein product, generally being fewer than about 20% of the total number of amino 
acids, more generally being fewer than about 10% of the total numbei of amino 

30 acids, which may be any sequence, particularly a nanirally occurring sequence 
where the multimer structural gene has been fused to another DNA sequence in 
reading frame. K, if present, will have the initiation methionine codon. L may be 
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the same or different from K, coming within the definition of K, but Hacking the 
initiation methionine codon. ; 

k and 1 are the same or different and are 0 or 1 ; 

W has the formula: 
5 [(A)„(B)p], 
wherein: 

A is a DNA sequence coding each time that it appears for the same amino 
acid repeating unit normally having at least one amino acid appear at least twice in 
the sequence, where A will generally be from about 9 to 90 nucleotides (nt), more 
10 usually from about 9 or 12 to 75 nt, preferably from about 9 or 12 to 45 nt, more 
preferably from about 9 or 12 to 30 nt. and in some instances may be: as few as 24 
nt: 

where there will usually be at least two different A's, usually not more than 
ten different A's. more usually not more than six different A*s, whiciii code for the 

15 same amino acid sequence but differ from each other by at least one nucleotide and 
may differ by as many as ten nucleotides, usually not differing by more than about 
five nucleotides from another A sequence, each of the different A's iisually being 
repeated at least twice; at least two different codons are employed for the same 
amino acid, e.g. GGC and GGA for glycine, in different A's coding for the same 

20 amino acid sequence unit; 

n will be an integer of at least 2, usually at least about 4, moi e usually at 
least about 8, and not more than about 250, usually not more than atiout 200, 
frequently not more than about 125, and in some instances may not cixceed about 50; 
B is a DNA sequence different from A coding for an amino acid sequence 

25 other than the amino acid sequence unit coded by the A unit and senses as a linking 
unit between oligomers of A units. B will generally have from about 3 to 45 nt, (1 
to 15 amino acids) more usually from about 3 to 30 nt (1 to 10 amina acids); 

where the B units appearing in the gene may be the same or different, there 
usually not being more than about 10 different B units, more usually not more than 

30 about 5 different B units, where the B units may differ from about 1 to 45 nt, more 
usually from about 1 to 15 nt, where the different B's may code for the same or 
different amino acid sequence; 
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p is 0 or I and may differ each time there is a successive A unit; 
q is an integer of at least 1 and will vary with the number of nucleotides in A 
and as well as the values of n and p. The variable q will be selected so as to 
provide for at least 90 nt for the multimeric portion of the structural gene, preferably 
5 at least about 150 nt, more preferably at least 450 nt, and most preferably at least 
900 nt, and the number of nucleotides will usually not exceed about 10,aX) nt, more 
usually not exceeding about 8,000 nt, generally being in the range of about 900 to 
6,000 nt, more usually to about 5,000 nt; and 

M is a DNA nucleotide sequence of about 12 to 150 nt, usually b<;ing 18 to 
10 150 nt, more usually not more than about 90 nt, which may encode any jtmino acid 
sequence, usually encoding a functional sequence which provides for a nawral or 
synthetic sequence resulting in a biological or chemical function or activity; 

r and s are the same or different, being 0 to 3, usually 0 to 2, depending on 
whether a functional group is present in the polymer, usually being 1 to where 
15 different, the same or similar functional groups may be combined in a ccntiguous 
manner; 

N is the same or different from M and comes within the same defmition as 

M; 

X may be the same as or different from W, usually different, and will have 
20 the formula: 

[A')ni (B*)p>lql 

wherein: 

A*, B^ N^ p^ and q* are the same as or different from A, B, n, p and q 
respectively, at least one being different, wherein the analogous symbohi come 
25 within the same definition as their counterparts; 
X is 0 or 1 ; 

Y may be the same as or different from W. usually different, and will have 
the formula: 

lA')n2 (B')p2]q2 

30 wherein: 
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A^, B^, n\ and are the same as or different from A, n„ p and q 
respectively, at least one being different, wherein the analogous symbols come 
within the same definitions as their counterparts. 

y isOor 1; 

5 i is 1 to 100. usually 1 to 50, more usually 1 to 30. panicularly 1 , when x, 

y, r and s are 0; 

when X or y are 1, q, and q^ will be a total of at least 2, usually at least 5 
and not more than about 50. usually not more than about 33, 

The toul number of nucleotides will be at least 900 nucleotides, usually at 
10 least about 1200 nt, preferably at least about 1500 nt and may be 20 lait 

(kilonucleotides), usually not more than about 6 knt. more usually not more than 
about 4 knt. 

The polypeptide encoded by the above DNA sequence will have the 

following formula: 
15 K\(W'M',X\N',YViL', 

wherein: 

W will have the following formula 

[(D)„ 

wherein: 

20 D is the amino acid sequence encoded for by A and therefore has the 

numerical limitations based on 3 nucleotides defining a codon that cedes for one 
amino acid; 

E is the amino acid sequence encoded for by B. and therefore; has the 
numerical limitations based on 3 nucleotides defining a codon. when; each E may be 
25 the same or different, depending upon the coding of B; 

and, wherein, likewise K\ W\ M\ X\ N', and L' is the amino acid 
sequence encoded for by K, W. M, X, N. Y and L respectively. However, in the 
case of K and L, subsequent processing, such as protease treatment, cyanogen 
bromide treatment, etc. may result in partial or complete removal ol' the N- or C- 
30 terminal non-multimeric chains. 

n, p, q, k, r. s, x, i and 1 have the same definitions as previously indicated. 
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Particular polymeric compositions having amino acid repeating units having 
the same compositions (A) will have the following formula where x and y are 0, 

where all of the symbols have been defined previously; and 
5 the DNA sequence will have the formula 

KJ(A),(B)pm 
where all of the symbols have been defined previously. 
Particular DNA sequences encoding copolymeric compositions having a 
repeating unit of two to three multimeric blocks will have the following formula: 

10 K\(W"M\X%N\YVrL"i 
wherein: 

W" is a multimer having the formula 

where AMs of 3 to 15, usually 4 to 6 codons, otherwise coming v;^ithin the 
15 definition of A; 

n^ will be from about 2 to 40. usually 2 to 32; 

is of from 2 to 20, usually 4 to 6 codons; 
pMs 0 or 1 ; 

is of from about 2 to 50, usually 2 to 30, depending on the value of n\ as 
20 discussed previously for n and q; 

X" and Y" are the same as or different from W\ usually differed, coming 
within the same definitions as W"; 

M" and N" come within the definitions of M' and N*; 
i" is at least 2, usually at least 5 and not more than about 75, usually not 
25 more than about 50, generally not exceeding 30; 

with the other symbols as defined previously, wherein at least ont of x and y 

is 1. 

The compositions of the invention will usually have a molecular weight of at 
least about 30 kDal, usually 50 kDal, frequently at least about 60 kDal md may 
30 have molecular weights as high or higher than 500 kDal, usually not exc:eeding 300 
kDal, more usually not exceeding about 250 kDal, and in many instances not 
exceeding 125 kDal, the higher ranges generally being the multimer combinations, 
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with the individual muitimer usually being less than about 150 kDal, usually less 
than about 100 kDal. 

The nucleotide sequences which are employed will be synthesized, so that the 
repetitive units will have different codons for the same amino acid as described 

5 above. Usually, at least about 25%, more usually at least about 40%, and generally 
at least about 60%, but not greater than about 95%, preferably not gniater than 
about 90% of the nucleotide sequences encoding the repetitive units w ill be the 
same. Greater diversity within those ranges will be employed where ilie initial 
constructs are experimentally shown to undergo spontaneous recombination events. 

10 Of particular interest are polypeptides which have as a repeating unit 

SGAGAG (G = glycine; A = alanine; S = serine), where the choice of the S as the 
initial amino acid of the repealing unit is arbitrary, since except for the first and last 
units, all the other units will be the same. This repeating unit is found in a naturally 
occurring silk fibroin protein, which can be represented as 

15 GAGAG{SGAGAG)8SGAAGY(Y = tyrosine). In the subject invention, the 
repeating unit is designed where the N-terminus may be MGAGAG or any other 
sequence of generally at least about 3 amino acids, usually at least about 5 amino 
acids, more usually 12 amino acids and not greater than 235, usually not greater 
than 100 amino acids, which may be different from the repetitive unii:. Generally, a 

20 different N-terminus will be the result of insertion of the gene into a vector in a 
manner that results in expression of a fusion protein. Any protein which does not 
interfere with the desired properties of the product may provide the M-ierminus. 
Panicularly. endogenous host proteins, e.g. bacterial proteins, may be employed. 
The choice of protein may depend on the nature of the transcriptional initiation 

25 region. Similarly, the C-terminus may have an amino acid sequence different from 
the repeat sequence. Conveniently, there may be from 1 to 125, frequently 1 to 
100, usually 1 to 25 amino acids, which may be the C-terminus of a naturally 
occurring structural gene, which again typically results from the formation of a 
fusion product. 

30 A silk-like-protein (Sip) gene may be produced by providing oligomers of 

from about 5 to 25 repeal units as described above, more usually of about 10 to 20 
repeal units. By having different cohesive ends, the oligomers may be 
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concatemerizcd to provide for the polymer having 2 or more of the oligomeric units, 
usually not more than about 50 oligomeric units, more usually not more than about 
30 oligomeric units* and frequently not more than about 25 oligomeric units. 

The silk-like proteins may be varied by having alternate multimei-s with the 
5 same or different handedness. For example, in the formula, {B\ may pi-ovide an 
even or odd number of amino acids. In silk, the hydrogens of the glycine may align 
on one side and the methyls and hydroxyls of alanine and serine on the cither. If 
(B)p is even, there will be continuous alignment, if odd, there will be alternating 
alignment of (A)„. Thus, different properties can be achieved by changing the 
10 number of amino acids encoded by (B)p. 

Of particular interest are polypeptides which mimic the composition and 
physical propenies of silks found in nature, e.g. Bombyx mori. 

Also of interest are polypeptides which have as a base repeating mnit GVGVP 
(G = glycine, V = valine, P - proline), which may be found in naturally occurring 
15 elastin; also VPGVG and/or APGVGV units, where again the choice of the initial 
amino acid in the repeating unit is arbitrary. In the subject invention, the N- 
lerminus may be any convenient sequence and, if desired, may be in whiole or in 
pan removed by a protease. Usually the N-terminal sequence which does not have 
the subject motif will be less than about 125. frequently less than about 100 amino 
20 acids, more usually less than about 60 amino acids. 

Of particular interest is a base sequence of about 2 to 32, preferably 8. units 
separated by a sequence of about 3 to 50 amino acids, usually 12 to 48 amino acids, 
which may include an internal repeat of from 3 to 15 amino acids diffeient from the 
basic repeating unit. For example, the second repeat sequence could be: GAGAGS, 
25 repeated twice. The total number of base repeating units will generally be in the 
range of about 150 to 500, more usually 150 to 300. and more usually 175 to 250. 
The C-terminus may terminate with a repetitive unit or portion thereof or a different 
sequence of from 1 to 125, usually 1 to 50 amino acids. The C-termimiis is not 
critical to the invention and will be selected primarily for convenience. As with the 
30 N-terminus. it may be designed for proteolytic cleavage. As in the case of the silk 
protein, the subject elastin-like protein may be similarly engineered. 
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Of particular interest are proteins which mimic the properties of elastin and 
provide for elastomeric properties and the use of elastin blocks to impa rt different 
physical properties, e.g. elastic properties or to modify the solubility properties of a 
different repetitive unit polymer. 
5 Of particular interest are collagen like proteins which have the :5equence 

Gap. where a and P may be any amino acid, particularly one being pioline. 
Usually in the protein a and p will be selected so that the total percent proline in the 
protein is between about 10 to 45 number % of the amino acids in the protein. The 
amino acids of particular interest other than glycine and proline are alainine, 
10 isoleucine, leucine, valine, serine, threonine, asparagine, glutamine. 

arginine, aspartic acid, glutamic acid, histidme. By known procedures after 
production of the protein, one or more prolines may be oxidized to hydroxyproiine. 

Also of interest are the polypeptides which have as a repeating unh K-L-(l)- 
L-A-E-A where 1 is a basic or acidic amino acid, particularly K or E and the 
15 repeating units alternate as to whether 1 is a basic or acidic amino acid. This 
strucmre is commonly found in keratin. 

The copolymer involving repeating units is a powerful method for varying 
properties, by appropriate choice of the different units, the number of units in each 
multimer, the spacing between them, and the number of repeats of the muliimer 
20 combination assembly. Thus, by varying the number and arrangement of primary 
monomers, a variety of different physical and chemical properties can be achieved. 

Exemplary of the use of the block copolymers are combinations of silk units 
and elastin units to provide products having properties distinctive from polymers 
only having the same monomeric unit. 
25 The repetitive proteins can find a variety of uses. The Sip proteins may be 

used in producing fibers having unique properties, as a substitute for uilk, and the 
like. Collagen proteins can be produced, where the collagen is free of the 
telopeptide or contains the telopeptide, depending upon its function. Atelopeptide 
collagen should have little if any immunogenicity, so as to be a useful! structural 
30 element for a variety of prosthetic devices or for use as a collagen substiftite in other 
applications. Similarly, other proteins having repetitive sequences, such as keratin, 
can also be prepared in accordance with the subject invention. Other useful 
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repetitive proteins can be prepared based on sequences of spider silks and other 
repetitive animal fibers. Artificial peptides useful for immunization could also be 
prepared based on repeating sequences present in various surface antigens of disease- 
causing microorganisms, such as parasites, bacteria, and viruses. 
5 The following examples are offered by way of illusuration and not by way of 

limitation. 



Examplg I 

DNA Preparation Methods 
10 1. Preparation of plasmid DNA from F. coli: 

A. Small scale: Plasmid DNA was prepared from 1.5 ml cultures by 
either the boiling procedure or the alkaline lysis method (Maniatis, et al.. Molecular 
Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory, Cold Spring 
Harbor. (1982)). 

15 B. Large scale: A plasmid-carrying strain was grown overnight in 1 liter 

of Luria broth with the appropriate antibiotic. The cells were collected by 
cenirifugation at I0,000xg for 5 min and resuspended in 10 ml of ice cold TE 
(lOmM Tris-HCl pH 8, ImM EDTA). The cells were centrifuged again, 
resuspended in 4 ml of TES (TE and 25% w/v sucrose) and homogenized by 

20 vonexing. The samples were kept on ice for the following steps. Lyso-zyme (1 ml 
of 10 mg/ml) was added to the cell suspension and incubated for 5 min before the 
addition of 2 ml of 0.5 M EDTA pH 8. After 10 min incubation, 50 ml of 
proteinase K (40 mg/ml) were added followed 10 min later with 15 ml of lysing 
buffer (0.1 % Triton X-100, ImM EDTA, 50 mM tris-Hcl pH 8). Aftex 15-20 min, 

25 the cell lysate was centrifuged at 35,000xg for 90-120 minutes. The supernatant 
(19.8 ml) was transferred to a plastic tube with 20 mg of CsCl and 400 /xl of 
eihidium bromide (10 mg/ml). After dissolution, the mixture was divided into two 
polyallomer ultracentriftige tubes, sealed with heat and centrifuged in a Beckman Ti 
65 rotor at 60.000 rpm for 24 hr. The lower plasmid DNA band was lemoved from 

30 the tube with a hypodermic needle. The ethidium bromide was extracted three times 
with an equal volume of NaCl-samrated isopropanol. Two volumes of H.O were 
added to the DNA solution, and then the DNA was precipitated with eiiianol. 
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2. pntpaprion of double-stranded DNA: 

A culture of JM103 was grown to an OD«x)Of about 0.2 and then divided 
into aliquots of 2 ml. Each aliquot was infected with a fresh plaque of M13 and 
incubated at 37'C for about 6 hours with vigorous shaking. Then the cells were 
5 pelleted and the supernatant was saved for subsequent infections. The double- 
sn-anded phage DNA was extracted by the boiling method (Maniatis et al.). 

3. T>pprnteinization: 

Phenol extraction was perfonned on a convenient volume of DNA sample, 
10 typically between 100 to 10 ml. The DNA sample was diluted in O.OIM Tris- 
HCl pH 7.5, ImM EDTA and an equal volume of water-saturated phenol was 
added. The sample was vortexed briefly and placed on ice for 3 minutes. After 
centrifiigation for 3 min in a microfuge, the aqueous layer was removed to a new 
tube and extracted once with an equal volume of chloroform risoamylalcohol (24:1). 

15 

4. Frth^HP^ precipitation: 

DNA in an aqueous buffer was concentrated by ethanol precipitation. To the 
DNA sample was added 1/10 volume of 3 M sodium acetate pH 7.5 and 2-3 
volumes of cold ethanol. The DNA was precipitated for 30 min at -70e>C or 
20 overnight at -20'C and then pelleted by centriftigation in the microfiige for 15 min 
at 4oC. The pellet was washed once with 200 fil of cold 80% ethanol and pelleted 
again for 10 min at 4oC. After air drying or lyophilization, the pellet:; were 
resuspended in the appropriate buffer. 

25 5. Phnsphatase trefltment of DNA: 

A. Phosphatase treatment of DNA was performed by adding 1 til (25 
units) of calf intestinal phosphatase (Boehringer Mannheim) directly to the 
restriction enzyme digestion reaction and continuing the incubation for 30 minutes at 
37oC. The phosphatase was inactivated for 60 min at 65*'C prior to d.eproteinization 

30 by phenol extraction. 

B. Phosphatase ireaiment of DNA was also performed by resuspending 
ethanol precipitated DNA from the restriction enzyme digest in 20 mlvl Tris-HCl pH 
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8.0. 10 inM MgClj to a final DNA concentration of 20 tig/ml Shrimp alkaline 
phosphause ("SAP") was added at 2 UZ/xg of DNA and the mixture was incubated at 
37*C for one hour, heat inactivated for 20 min at 65X and then passed ilirough a 
Probind filter (Millipore) and subsequently a Bio-Spin column. The DNA was then 
5 ethanol precipitated and resuspended in suitable buffer. 

6. PhOPhO^'ation of DNA: 

Phosphorylation before annealing was performed by using Polynu:leoiide 
Kinase 3'-phosphatase-free (Boerhinger Mannheim). The reaction was carried out at 
10 3TC for 30 min in a 50 ^1 reaction volume containing: 12.5 Mg DNA, 5 ^1 lOx 
kinase buffer (0.5 M Tris pH 7.5, 10 mM Spermidine, 0.1 M MgCl^, 150 mM 
DTT, 1 mM EDTA), and 2 ^1 Polynucleotide Kinase (10 U//il). After 
phosphorylation, salts and glycerol were removed from the DNA strands using a 
Bio-Spin 6 column (BioRad) equilibriated in TEAB. 

15 

7. Fill-in reaction with DNA polymerase I; 

DNA was resuspended in buffer containing 50 mM Tris-HCl pH 7.4. 50 mM 
KCl. 5mM MgClj, and 400 /xM each of the four deoxynucleotide triphosphates. 
Ten units of Klenow DNA polymerase (BRL) were added, and the reaction was 
20 allowed to proceed for 15 min at room temperature. The DNA was then phenol 
extracted and ethanol precipitated. 

8. T4 polynucleotide kinase reaction: 

The reaction (10 ^0 contained: T4 polynucleotide kinase (BRL) , 150 ng of 
25 DNA, 1 ^1 of 10 X kinase buffer (0.7 M Tris-HCl pH 7.6, O.IM MgClj, 50 mM 
DTT) and [^^P]-ATP (200-300 nCi). This was incubated at 37oC for 20 minutes and 
then the DNA was purified using a NACS column (Bethesda Research I-abs). 

9. Digestion with restrictio n ^nHnnucleases: 

30 DNA was digested with restriction endonucleases (REN) in 1 x 'AA** buffer 

(10 X AA buffer is 330 mM Tris-acetate, pH 7.9, 660 mM potassium acetate, 100 
mM magnesium acetate, 50 mM dithiothreitol (DTT) and 1 mg/ml bov ine serum 
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albumin (nuclease free)]. Whenever possible, the concentration of DNfA was kept 
below 1 fig/25 fjil Incubation was at 3VC for 1-4 hrs for most restriction 
endonucleases except for Ball, Banl and Nael digestions which were incubated 
overnight. 

5 

10. Analytical agarose gel ele ctrophoresis of DNA: 

To DNA samples for gel analysis we added 0.2 volumes of loading buffer (5 
X electrophoresis buffer, 0.01 % bromphenol blue dye. 50 mM EDTA, and 50% 
glycerol). Then the samples were loaded into lanes of a horizontal submerged 

10 electrophoresis unit containing a 1.0% (w/v) agarose gel. The electroiphoresis buffer 
was either 1 x TAC or 1/2 x TEE. The 1 x TAC is 40 mM Tris-base, 10 mM 
EDTA, adjusted to pH 7.8 with acetic acid. The 1/2 x TEE is 0.045M Tris-base, 
0.045M boric acid, ImM EDTA, pH 8. The gel was run at 40-50V for 18 hr, then 
removed and stained with 0.5 ^tg/ml ethidium bromide for 30 minutes. The DNA 

15 bands were visualized on a long wavelength UV transilluminator. 

11. Preparative agarose gel electrophoresis: 

The procedures and materials are the same as for the analytical agarose gel 
electrophoresis. The only difference is the use of low melting point ("LMP") 
20 agarose, ranging in concentration from 0.5 to 2.5% (w/v) depending (m the size of 
the DNA fragment to be purified. DNA restriction fragments were excised from the 
LMP agarose gels after visualization with ethidium bromide. For agarose ligation, 
the buffer used was Ix TAE (50 mM Tris-acetate, pH 7.8). 

25 12. NACS purification: 

Gel fragments containing DNA were melted at 70**C for 5 min and diluted 
approximately 5 fold with TEl (10 mM Tris-HCl pH 7.5, 0.2M NaCl). The gel 
solution was applied to a NACS column (BRL). The colunm was washed with 5 ml 
of the same buffer. The bound DNA was eluted with 300 >il of either TE2 (10 mM 

30 Tris-HCl pH 7.5, 1.0 M NaCl) for DNA fragments smaller than lOOO bp or TE3 
(10 mM Tris-HCl pH 7.5. 2M NaCl) for larger fragments. The elute<i DNA was 
concentrated by ethanol precipitation. 
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13. DNA ligation: 

Reactions for ligating cohesive ends contained: 1 DNA, 1 x AA buffer 
(see step 9. above) 1 mM ATP and 20 units of T4 DNA ligase (BRL) in a 20 /il 
final reaction volume. The ligation was allowed to proceed for 16-18 hr at 15 oC or 
5 1-2 hr at room temperature. For blunt-ended ligations the reactions contained 1 
DNA, 25 mM Tris-KCl pH 7.5, 5 mM MgQj, 5 mM DTT, 0.25 mM spermidine, 
200 mg BSA, 1 mM hexamine cobalt chloride (HCC). 0.5mM ATP and 400 units 
T4 DNA ligase (NEB) in a 20 m1 reaction volume. The ligation was alloived to 
proceed for 30 min to 1 hr at room temperature. 

10 

14. Agarnfie DNA Ligation 

The agarose was melted at 65 the temperanire was then lowered to 37 °C 
and ligation buffer (5x = 100 mM Tris-HCl, pH 7.5, 50 mM MgClj, 50 mM DTT, 
1 mM ATP) was added; the tube was then placed at room temperanire and ligase 
15 was added (1000 units T4 DNA ligase (NEB)), the reaction volume was usually 50 
ptl. The reaction was incubated at 15**C for 16-18 hours. 

15. Use nf Filters and Columns for DNA Purification. 

A. Ultrafree®-Probind filter unit ("Probind", Millipore): the DNA 
20 containing solution was applied to the filter unit and spun at 12,000 RPM for 30 

seconds in a Sorvall Microspin 24S. 

B. Microcon-30 filter (Amicon): the DNA containing solution was 
washed by applying to the filter and exchanging twice with HjO by spinning at 
12,000 RPM for 6 min in a microfuge. 

25 C. Bio-Spin 6 column ("Bio-Spin", BioRad): Salts and glycerol were 

removed from the DNA solution by applying to the column, previously equilibrated 
in TEAB (triethyl ammonium bicarbonate pH 7.0), and spinning in a Sorvall RC5B 
centrifuge using an HB4 rotor at 2,500 RPM for 4 min. 

30 16. Agarose DNA Purification Using T Jltraf^e»-Mr Filter Unit: 

This procedure can be used for agarose slices up to 400 ^1 in size. After 
agarose gel electrophoresis the DNA is visualized by ethidium bromide staining and 
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the agarose block containing the DNA band of interest is excised. The agarose is 
then frozen at -20°C for 1 hour; then quickly thawed at 3TC for 5 mimutes. The 
agarose is then thoroughly macerated. The pieces are then transferred into the 
sample cup of the filter unit and spun at 5,000xg in a standard microftige for 20 
5 mins. The agarose is then resuspended in 200 /xl of Tris-EDTA, or other buffer, 
and incubated at room temperature for 30 mins. to allow for elution ol: additional 
DNA from the gel. The mixture is then centrifuged for an additional 20 min at 
10,000 RPM. The DNA is, at this point, in the filtrate nibe separated from the 
agarose fragments and ready for subsequent DNA manipulations. 

10 

Racterial Transformation Methods 

1. Preparation o f transformation-comnetent F, CoU CCllS: 

A culture of 200 ml of sterile L broth was inoculated with a small loopful of 
£. coli cells. This was incubated with shaking at 37oC until the OD^ was 

15 approximately 0.5. The culnire was placed on ice for 10 min and centrifuged at 
6,000xg for 10 minutes. The cell pellet was resuspended in 100 ml of ice-cold 
O.IM MgCl,, kept on ice for 30-40 min and centrifuged again. The pellet was 
resuspended in 2 ml of ice-cold lOOmM CaClj, transferred to a sterile test tube and 
incubated on ice for 24 hours. The competent cells were then aliquotud and stored 

20 at -70^C. 

2. Transformation of £. coU\ 

An aliquot of frozen competent cells were thawed on ice. To 50 pX of cells 
0.1 to 1 /ig of DNA was added and the mixture was incubated on ice for 30 minutes. 
25 The nibe was removed from ice and placed in a 42oC bath for 2 minutes. L broth 
(1 ml) was added and the transformation mix incubated with shaking ;it the desired 
temperature (usually 30**C or 37oC) for 2 hours. Then one-tenth of thie 
transformation was plated on L broth plates containing the appropriate antibiotic 
and, when necessary, XGAL and IPTG were added, 

30 
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3. DNA transformarion of g. mMlir, 

B. subtilis cells were grown to early stationary phase (change in KLieti units of 
^5% in 15 min). Transformation followed established procedures (Anagnostopoulos 
et al., 1981) (ref. 8). Cells (0.45 ml) were incubated with 1-10 ^% of D1>JA at 37oC 
5 for 80 minutes with shaking, and then plated on TBAB agar plates with an 
appropriate antibiotic. 

4. Ysnlation of plasmid DNA from B. subtilis: 

Plasmid DNA from B. subtilis was obtained by a method similar to the 
10 alkaline-lysis method except that pelleted cells were resuspended in 8 ml of solution 
1 (50m M glucose, 10 mM EDTA. 25 mM Tris-HCl (pH 8.0), 10 mg/ral lysozyme) 
and incubated at room temperature for 30 minutes. Then 16 ml of solution 2 (0.2N 
NaOH, 1 % (w/v) SDS) was added and incubated on ice for 10 minutes. Finally, 12 
ml of 3M potassium acetate (pH 4.8) was added and incubated an additictnal 20 min 
15 on ice. The lysed cells were centrifuged 15 min at 15,(K)0 rpm in a Sorval SS-34 
rotor. The DNA was precipitated by adding an equal volume of isopropyl alcohol 
and centrifuged at 7,000 rpm. The pellet was resuspended in 5 ml of ICtmM Tris- 
HCl (pH 7.5), 1 mM EDTA (TE). The solution was phenol extracted once and 
chloroform extracted. DNA was precipitated with ethanol and resuspended in 3 ml 
20 of TE. The volume was adjusted to 5.2 ml by adding 4.2g CsCl. 400 fil of eihidium 
bromide at 10 mg/ml and TE. The solution was transferred to a Beckman quickseal 
polyallomer centrifuge nibe and centrifuged at 45,000 rpm in a Beckman vti65 rotor 
for 18 hours. 

25 Antibody Pro duction. Protein Chemistry and Electrophoresis of Proteins 
1. Preparation of antibody to artificially synthesized peptides: 

Synthetic peptide of sequence (GAGAGS)8GAAGY was coupled to BSA 
using the glutaraldehyde procedure of Kagen and Glick (1979). The dejiree of 
coupling was monitored using trace amounts of radioactive iodinated synthetic 

30 peptide. 

A peptide of 53 amino acids corresponding to the SlpIII sequence was 
prepared on an Applied Biosystems peptide synthesizer. The yield of this material. 
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which has a molecular weight of 3640 was approximately 0.5 grams. The peptide 
was coupled to bovine serum albumin. The material was sent to Antibodies. Inc. for 
preparation of antibodies in rabbits. Peptide conjugates at a concentration of 
Img/ml in complete Freund's adjuvant were used to inmiunize rabbits at day 0. 

5 Animals were re-injected with antigen in Freund's incomplete adjuvant at day 30 
and titered at day 60. Positive sera was detected using a microtiter RIA. using' the 
synthetic peptide as antigen. Kagen and Glick (1979), in Methods of 
Radioinmiunoassay, Jaffe and Herman (eds.). Academic Press, p 328. Antisera was 
obtained that reacted with synthetic peptides of both the SlpI and SlpIII sequences. 

10 These antisera have been useful for the detection of fusion peptides containing giy- 
ala (SLP) sequences. 

Following the procedure described above an antigen was synthesized having 
the formula (V-P-G-V-Og, which was coupled to keyhole limpet hemocyanin. 
Polyclonal antisera was then prepared as described above which bound to the ELP 

15 peptide. 

Following the same procedure, additional antigens were synthesized having 
the formula YTITVYAVTGRGDSPASSKPISINYC of fibroneciin (the FCB portion) 
and the formula (GAPGAPGSQGAPGLQ)jYMK (a repeat unit of the collagen-like 
protein (CLP) sequence) which were coupled to keyhole limpet hemoc> anin for use 
20 as immunogens. Polyclonal antisera were then prepared as described above which 
bound, respectively, to the FCB peptide, and to the synthetic peptide ol" the CLP 3.7 
sequence and PPAS sequence described below. 

2. Polvacrvlamide gel electrophoresis of proteins: 

25 Approximately 10' £. coli cells from growing culmres were pelileted by 

centrifugation at lO.OOOxg for 5 minutes. The cell pellets were resuspended in 100 
to 500 (xl of 2X sample buffer (100 mM Tris-HCl pH 6.8, 4% SDS, 10% P- 
mercaptoethanol, 60% glycerol or sucrose) and sonicated for 30 sec using a Tekmar 
sonic disruptor. Samples were boiled for approximately 5 min and 20 to 100 fil of 

30 the cell lysates were loaded on an SDS-polyacrylamide gel (7.5 to 16% w/v). The 
gels were prepared following the procedure of Laemmli, Nanire (1970) 227:680- 
685. The proteins in the gels were stained with 2% Coomassie brilliant blue in 10% 
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methanol, 7.5% acetic acid for 1 hr and destained in 10% methanol, 7.5% acetic 
acid overnight. 



3. Protein expr ession analysis: 

5 An overnight culnire which had been grown at 30° C was used to inoculate 50 

ml of the LB media contained in a 250 ml flask. Kanamycin was added at a final 
concentration of 50 ^ig/ml and the culture was incubated with agitation (200 RPM) 
at SO'^C. When the culture reached an ODgoo of 0.8, 40 ml were u-ansfeited to a 
new flask pre warmed at 42**C. And incubated at the same temperature for 

10 approximately 2 hours. The cultures (30**C and 42°C) were chilled on ice and 
OD^ was taken. Cells were collected by centrifiigation and then divided in 1 .0 
OD^oo aliquots and used to perform western analysis using the appropriate 
antibodies. 

15 4. Immunoblotting of proteins in ^rels: 

After protein electrophoresis, one of the flanking glass plates was; removed 
from the polyacrylamide gel. The gel surface was wetted with transfer buffer (25 
mM Tris-HCl, 192 mM glycine, 20% methanol). A piece of nitrocellulose paper 
(Sartorius, SMI 1307) was saturated with transfer buffer and laid on the jel. Air 

20 bubbles between the filter and the gel were removed. The gel and nitrocellulose 
filter were placed in the transfer unit as specified by manufacturer (BioRad). 
Transfer was allowed to proceed at 200 mA for 3-4 hours. Then the nitrocellulose 
filler was removed and stained with Amido- Schwartz for 3 min (0.05% Amido 
black, 45% deionized HjO. 45% methanol, 10% acetic acid) and destained in H^O. 

25 The filter was incubated for at least 10 min at room temperature in "BU3TT0" (5% 
w/v nonfat dry milk, 50 mM Tris-HCl pH 7.4, 0.9% w/v NaCl, 0.2% w/v sodium 
azide). The filter was placed in serum appropriately diluted (1:50 to 1 ;500) in 
0.5X Blotto (2.5% nonfat dry milk, 50 mM Tris-HCl pK 7.4, 0.9% NaCl, 0.2% 
sodium azide) and was gently agitated for approximately 16 hr at room i:emperamre. 

30 The filter was washed for 1 hr with 5 changes of TSA (50 mM Tris-HCl pH 7.4, 
0.9% NaCl. 0.2% sodium azide). The blot was placed in 15 ml of 0.5X BLOTTO 
solution containing 1x10^ cpm of the *^M-protein A and gently agitated f^r 2 hr at 
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room temperature. The filter was washed for 2 hr with a minimum of 7 changes of 
TSA, rinsed once with deionized HjO and air dried. The blot was covered with 
Saran* wrap and autoradiographed. 

An alternative to the ^^^I-Protein A detection method was also used. This 
5 method relied on a chemiluminescent signal activated by horseradish peroxidase 
(HRP). The chemiluminescent reagents are readily available from several suppliers 
such as Amersham and DuPont NEN. The western blot was prepared jind blocked 
with BLOTTO. A number of methods were used to introduce the HRP reponer 
enzyme including, for example, a hapten/anti-hapten-HRP, a biotinylated 

10 antibody /streptavidin-HRP» a secondary reporter such as a goat or mouse anti-rabbit 
IgG-biotinylated/streptavidin-HRP, or a goat or mouse-ami rabbit IgG HRP. These 
reagents were bought from different sources such as BioRad or Amersliam and 
occasionally biotinylated antibodies were prepared in our laboratory using Biotin 
NHS from Vector Laboratories, Buriingame, CA. (Cat. #SP-1200) following the 

15 procedure accompanying the product. The following is an example of a procedure 
used to detect the expression of protein polymers. 

The blot was placed in 15 ml of BLOTTO solution containing biotinylated 
goat anti-rabbit IgG (BioRad) diluted in BLOTTO (1:7500) and gently agitated for 2 
hrs at room temperature. The filter was then washed for 30 minutes wiith 3 changes 

20 of TSA (50 mM Tris-HCI pH 7.4, 0.9% NaCl, 0.2% sodium azide). The blot was 
then incubated for 20 minutes at room temperature with gentle rotation, in 20 ml of 
TBS (100 mM Tris Base, 150 mM NaCl, pH 7.5) HRP-Streptavidin (iVmersham) 
diluted 1:10(K) in TBS with 0.1% Tween 20. The blot was then washed three times 
for 5 minutes each in TBS with 0.3% Tween 20 and then three times for 5 minutes 

25 each in TBS with 0.1% Tween 20. The blot was then incubated for 1 minute with 
gentle agitation in 12 ml of development solutions #1 an #2 (Amersham) equally 
mixed. The blot was removed from the development solution and autoradiographed. 



5. Amino Acid Analysis: 
30 Amino acid compositions are determined by the PTC derivitization procedure 

of Henrickson and Meredith (1984). Protein samples were hydrolysed with 5.7 N 
constant boiling KCl at 108oC for 24 hours in vacuo. After reaction \^^ith PITC, 
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amino acid derivatives were detected at 254 ran by HPLC reverse phase 
chromatography using a Hewlett Packard 1090 or Waters 600E system and a 
Supelco CIS column (4.6 nun x 25 cm) with a linear gradient of 0-50% acetonitrile 
in 0.1 M NH4OAC pH 6.78 as a mobile base. Henrickson, R.L. and Meredith, S C. 
5 (1984) Amino Analysis by Reverse Phase High Performance Liquid 
Chromatography. Anal. Biochem. 137:65-74. 

6. Amino Acid Sequence Analvsis: 

The N-terminal amino acid sequence was determined by automatsd Edman 
10 degradation using an Applied Biosystems Model 470A gas phase protein sequenator. 
The PTH amino acid derivatives were analyzed by reverse phase HPLC using a 
Hewlett Packard 1090 or Waters 600E system and an Altex C18 column (2 mm x 25 
cm) with a complex gradient buffer system. 

15 7. Peptide Synthesis: 

Synthetic peptides were prepared by solid phase synthesis on an Applied 
Biosystems Model 430A Peptide Synthesizer using the standard symmetric anhydride 
chemistry as provided by the manufacturer. The coupling yield at each step was 
detennined by the quantiuiive ninhydrin procedure of Sarin et al., (1981). The 

20 synthetic peptide was cleaved from the solid support and amino acid blocking groups 
were removed using anhydrous HF (Stewart and Young, 1984). Crude peptides 
were desalted by chromatography over Sephadex G-50. Sarin, V.K., K^ent. S.B.H., 
Tarn, J. P. and Merrifield, R.B. (1981). AnaL Biochem, 237:927-936. Stewart, 
J.M. and Young. J.D. (1984). Solid Phase Peptide Synthesis, Pierce Chemical 

25 Company, Rockford, IL. pp 85-89. 

Synthetic DNA Methods 

1. In vitro PNA ??vmh€sis; 

The N,N-diisopropylphosphoramidites or p-cyanoethylphosphoramidites, 
30 controUed-pore glass columns and all synthesis reagents were obtained from Applied 
Biosystems, Foster City, California. 
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Synthetic oligonucleotides were prepared by the phosphite iriestcr method 
with an Applied Biosystems Model 380A or 381 A DNA synthesizer using a 10-fold 
excess of protected phosphoramidites and 0.2 or 1 ^mole of nucleotide bound to the 
synthesis support column. The chemistries used for synthesis are the standard 
5 protocols recommended for use with the synthesizer and have been described 
(Matteucci, et al.. Journal Amer. Chem. Soc, . 103:3185-3319 (1981)). 
Deprotection and cleavage of the oligomers from the solid support were performed 
according to standard procedures as described by McBride, et aL, Tetrahedron 
Letters, 24:245-248 (1983) and as provided by Applied Biosystems. The repetitive 
10 yield of the synthesis as measured by the optical density of the removed protecting 
group as recommended by Applied Biosystems (1984) was greater than. 97.5%. 

The crude oligonucleotide mixture was purified by preparative gel 
electrophoresis as described by the Applied Biosystems protocols of November 9, 
1984 (User Bulletin No. 13) and as updated in the Applied Biosystems protocols in 
15 Evaluating and Isolating Synthetic Oligonucleotides, 1992. The acrylamide gel 
concentration varied from 10 to 20% depending upon the length of the oligomer. 
The purified oligomer was identified by UV shadowing, excised from the gel and 
extracted by the crush and soak procedure (Smith, Methods in Enzymology, 65:371- 
379 (1980)). 

20 For DNA synthesis of oligonucleotides longer than 100 bases, the synthesis 

cycle was changed from the protocol recommended by Applied Biosystems for the 
381 A DNA synthesizer. All the reagents used were fresh. All the reagents were 
supplied by Applied Biosystems except for the acetonitrile (Burdick and Jackson 
Cat#017-4 with water content less then 0.001 %) and the 2000 A por: size column 

25 (Glen Research). Due to the length of the oligo. interrupt pauses had lo be inserted 
during the synthesis to allow changing the reagent bottles that emptied during 
synthesis. This interrupt pause was done at the cycle entry step and ths pause was 
kept as short as possible. The washes after deuitylation by TCA, through the 
beginning of each synthesis cycle, were increased from about 2x to 3>: over the 

30 recommended time. The time allocated for the capping was also increased to limit 
truncated failure sequences. After the synthesis the deprotection was clone at 55 X 
for 6 hours. After desalting the synthesized DNA was amplified using PCR. 
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2. Sequencing of PNA; 

DNA sequences were determined by the following methods. Fragments 
containing the region of interest were cloned into the multiple cloning site of 
M13mpl8 or M13mpl9 and single-stranded DNA was prepared and sequenced by 

5 the primer extension method as described in the literature. (Sanger et al. 1977; 
Maniatis et aL, 1982; Norrander et aL 1983. Q£ii£,2fi:10M06 ; Sanger et ai 1977 
Proc Nad. Acad. Sci. USA. 74:5463-5467 and Biggin 1983 PrOP, Natl. Acad. 
Sci.USA. aQ:3963-3965; Sanger et aL 1978, FFBS Letters. 87:107-110^ using "S- 
deoxy adenosine 5'(alpha-thio)-triphosphate (New England Nuclear) as label. In 

10 some cases, reverse transcriptase (Molecular Genetics) was used to extend the 

primer, using the dideoxy:deoxynucleoside tri-phosphate ratios utilized by Zagursky 
eial.. Gene Anal. Techn. (1985)2:89-94. 

Deoxyadenosine triphosphate labeled with either ^'P or "S was used in these 
reactions. Compression artifacts which appeared in some G-C rich sequences were 

15 overcome by eliminating deoxyguanosine triphosphate from the G reaction, and 
using deoxyinosine triphosphate (P-L Biochemicals) at a final concentration of 37.5 
;iM instead. In the other mixes, the concentration of dideoxyGTP in the G reaction 
was 0.5 mM. All sequences were run on 6 or 8% polyacrylamide gels containing 8 
M urea (Sanger et al. 1978). Primers used for sequencing were purchased from P-L 

20 Biochemicals. Storage and analysis of data utilized software from both DNAsiar 
and International Biotechnologies, Inc for IBM personal computer and DNA Strider, 
DNA Inspection He or DNAid for Apple Macintosh personal computer. 

3. In vitro muta genesis of cloned DNA: 

25 Plasmid DNA (1 /xg) conuining the sequence to be mutated was <ligested in 

two separate reactions. One reaction contained either one or two restriction 
endonucleases which cleave at sites immediately flanking the region of interest. In 
the second reaction, the DNA was digested with a restriction endonuclease which 
cleaves only once at a site distant from the sequence to be mutated. The DNA 

30 fragments generated in the first reaction were separated by agarose gel 

electrophoresis and the large fragment which lacks the sequence to be mutated was 
excised and purified. DNA from the second reaction, the large fragment of DNA 
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from the first reaction, and a synthetic oiigodeoxynucleotide of 20-30 bases in length 
containing the mutant sequence were mixed in a molar ratio of 1: 1:25(1. The 
mixture was denatured by heating at lOOoC for 3 min in 25 to 100 fi\ of 100 mM 
NaCl, 6.5 mM Tris-HCl pH 7.5, 8 mM MgClj, and 1 mM P-mercaptoethanoI. The 

5 denatured mixture was reannealed by gradually lowering the temperaanre as follows: 
37oC for 30 min, 4oC for 30 min, and O^C for 10 minutes. The reaction was 
supplemented with 0.5 mM deoxyribonucleotide triphosphates, 1 mM ATP, 400 
units of T4 DNA ligase and 5 units of £. coli DNA polymerase large Iragment and 
incubated at 15oC for 12-16 hours. The reaction mixture was then transformed into 

10 E. coli and antibiotic-resistant colonies were selected. 



4. Dideoxv DNA Sequencing of Double Stranded Plasmid DNA: 

Plasmid DNA was prepared as described previously (Preparaticm of plasmid 
DNA from E. coli. Small Scale, Maniatis et al.). Primers were synth<;si2ed using a 
15 DNA synthesizer as described previously, and were annealed to the plasmid DNA 
following the procedure described above for M13 sequencing. The sequencing 
reactions were done using Sequenase (United States Biochemicals) and the 
conditions were as recommended by the supplier. All sequences were run on 
polyacrylamide gels as described above. 

20 

5. PCR amplification: 

The PCR reaction was performed in a 100 fil volume in a Perkin Elmer thin- 
walled Gene Amp™ reaction tube. Approximately 1 /xl of each primer DNA was 
added to Ix PCR buffer (supplied by Perkin Elmer as lOx solution), 200 (iM of each 

25 dNT, 5U AmpliTaq, and several concentrations of the target DNA. /Amplification 
was performed in a Perkin Elmer DNA Thermal cycler model 480 for 30 cycles 
with the following step cycles of 12 min each: 95*'C, 62°C, and 72**C. Aliquots 
from the different reactions were analyzed by Agarose Gel Electrophoresis using 
1.5% Low Melting Point agarose in 0.5x Ta buffer. The reaction mixtures that 

30 gave the desired band were pooled and spun through an Ultrafree-Probind filter unit 
(Millipore) at 12,000 Rpm for 30 seconds in a Sorvall Microspin 24S to remove the 
AmpHTaq enzyme. The buffer was then exchanged with H2O two times, using a 
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Microcon-30 filter (Amicon) by spinning at 12,000 RPM for 6 min in a microftige. 
Salts and glycerol were removed from the amplified dsDNA using a Bio-Spin 6 
column (from BioRad) equilibrated in TEAB, in a Sorvall RC5B centrifuge using an 
HB4 rotor at 2,500 RPM for 4 minutes. The DNA was then concentrated in vacuo. 

5 

ip^pngntation Conditions 

The fermentor is a 15 L Chemap. 10 L working volume. The culture 
conditions are: temperature = 30''C, pH 6.8; NaOH 2.5 M is used for pH 
regulation. The headspace pressure is below 0.1 bar. The dissolved oxygen is 
10 regulated at 50%. The air flow varies from 0.5 L/min to 20 L/minute. llie 

agitation rate varies between 200 to 1500 rpm. The fermentor is inoculated with a 
10% (v/v) inoculum grown in medium A for 15 hours at 30**C under agitation. 

Medium B, C or D was the fermentor medium. The starting volume in the 
case of 10 liter fermentation, is no less than 3 L, and in the case of a 1 liter 
15 fermentation, is no less than 0.5 liters. 

If the fermentor staning volume is less than the final volume desired, then 
when the carbon source concentration reaches 1 %, a concentrated solution (5x) of 
medium B,C, or D, respectively, is added to the fermentor in order to keep the 
carbon source concentration approximately 1%. 
20 When the culmre reached an OD^oo of 60.0, the lemperamre was increased to 

42oC for 10 min, then lowered to 39 or 40°C for 2 - 3 hours. The cells were then 
harvested by centrifugation and, if necessary, frozen at -VOX until processed. 

Other fermentors used for the expression of protein polymers were usually a 
15 1 MBR, 10 1 working volume, or a 13 I Braun Biostat E, 8.5 I working volume. 
25 The choice of the fermentor and its size is not critical. Any media used fDr the 
growth of E, coli can be used. The nitrogen source ranged from NZAmiiie to 
inorganic salts and the carbon source generally used was glycerol or glucose. All 
fermentations were done with the appropriate selection conditions imposed by the 
plasmid requirements (e.g. kanamycin, ampicillin, etc.). The fermentation method 
30 used to express protein polymers in E. coli was the fed-batch method. Tliis is the 
preferred method for the fermentation of recombinant organisms even if other 
methods can be used. 
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The fed-batch method exploits the stage of cell growth where the organisms 
make a transition from exponential to stationary phase. This transition is often the 
result of either depletion of an essential nutrient or accumulation of a metabolic 
byproduct. When the transition is the result of nutrient depletion, the addition of 

5 nutrients to the system causes cell division to continue. One or more essential 
nutrients can incrementally be added to the fermentation vessel during the run. with 
the net volume increasing during the fermentation process. The resuk is a controlled 
growth rate where biomass and expression levels can be optimized. When the cell 
number in the culture has reached or is approaching a maximum, protein polymer 

10 production is induced by providing an appropriate physical or chemical signal. 

depending upon the expression system used. Production will then coni:inue until the 
accumulated product reaches maximum levels (Fiestchko, J., and Ritch, T., Chem. 
Eng. Commun. (1986). 45:229-240. Seo. J.H.; Bailey, J.E., Bioiechnol. Bioeng. 
(1986). 28:1590-1594. 

15 

Tflhlp 1 • Medinm Table 

M^Hium A- I R Medium 
Constiment 
NaCl 
20 tryptone 
yeast extract 



10 
10 

5 



kanamycin ^^^^ 
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Medium B 

rnnwinient SZL 

NH4CI 
KH,P04 

5 MgS04-7HiO 018 

K,SO, 009 

CaCl, 24x10' 

FeSO,- 7HiO 7.6x10-' 

7E 0-5 ml 

10 casamino acids 25 

yeast extract ^ 

glucose 20 

kanatnycin 5x10' 

15 Medium D 

Consituent SH 

(NHJSO* 5.6 

K,HP04 6.7 

MgSO,- 7HiO 7.8 

20 NaH:P04H:0 3.8 

EDTA 0.98 

Trace Elements 1 "^1 

Yeast Extract or NZ Amine 50 

Glucose or glycerol 20 

25 Kanamycin or ampicillin 5 x 10 ' 

pxample 2 

Assemhlv and Ex pression of the SlpI Gene 
1. Summary of tht- sr.heme for assemhling the SIdI gene: 
30 An 18 bp DNA sequence that codes for the most frequent repeating 

oligopeptide in the silk fibroin protein made by Bombyx mori [Lucas, F . and K.M. 

Rudall (1986) Extracellular Fibrous Proteins: The Silks, p. 475-558. in 
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Comprehensive Biochemistry, vol. 26, part B., M. Florkin and F.H. Stotz (eds.) 
Elsevier, Amsterdam] was synthesized in vitro. Two single-strands were 
synthesized, annealed together and then the resulting double-stranded sediments were 
multimerized head-to-tail to generate concatamers of up to and exceeding 13 repeats. 
5 The structural gene for silk I that we proceeded to work with had 13 repeats that 
coded for the oligopeptide GAGAGS, where g - glycine, a = alanine and 
s = serine. We refer to this strucmral gene as the "monomer". We constructed 
''dimeric, trimeric. tetrameric, pentameric and hexameric" SlpI genes containing 26 
(SlpI-2), 39 (SlpI-3), 52 (SlpM), 65 (SlpI-5) and 78 (SlpI-6) repeats. Ihere is a 
10 shon intervening sequence between each monomer unit. The assembly is pictured as 
follows: 

Repeating DNA Sequence 5'-GGTGCGGGCGCAGGAAGT 

CGCCCGCGTCCTTCACCA-5 ' 

15 



*Monomer^ 



Moltimers 



2. Assembly of the "monomgric" SlpI structural gcng; 

The two single-strands shown above were synthesized as previously 

20 described. The strands were separately purified by gel electrophoresis, 

phosphorylated using T4 polynucleotide kinase and then mixed together and allowed 
to anneal. This resulted in the double-stranded segments aligning spontaneously 
head-to- tail in long concatamers. The phosphodiester bonds between sejjments were 
formed with T4 DNA ligase. The reaction was stopped by filling in the terminal 

25 cohesive ends using the Klenow fragment of DNA polymerase I. The bilunt-ended 
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repeating DNA was then ligated to the Hindi REN site in plasmid vector pUC12 
(Veiera, et al.. Gene 19:259-268 (1982)). The ligated DNA was transformed into 
E, colt HBlOl and transformants were selected for their ability to grow in the 
presence of ampicillin. The DNA of potential clones was analyzed; for uize and 

5 orientation by REN digestion and gel electrophoresis. DNA sequences \/ere 
determined for isolates with large inserts that were oriented properly. Trie 
"monomer" clone selected for subsequent multimerization had 13 repeatij coding for 
the oligopeptide AGAGSG, and was named pSY708. The DNA sequence, deduced 
amino acid sequence and REN sites of the SlpI insert and flanking regions of 

10 pSY708 are shown in Table 2. 

Tablg 2 

H PAS 
I S V M 

N T A A 

15 3 111 

I III 

I III 

AAGCTTGGGCTGCAGGTCACCCGGGCGGGCGCAGGAAGTGGTGCGGGCGCAGGAAGTGGT 
TTCGAACCCGACGTCCAGTGGGCCCGCCCGCGTCCTTCACCACGCCCGCGTCCTTCACCA 60 
20 KLGLQVTRAGAGSGAGAGSG 

GCGGGCGCAGGAAGTGGTGCGGGCGCAGGAAGTGGTGCGGGCGCAGGAAGTGGTGCGGGC 
CGCCCGCGTCCTTCACCACGCCCGCGTCCTTCACCACGCCCGCGTCCTTCACCACGCCCG 120 
AGAGSGAGAGSGAGAGSGAG 



25 



GCAGGAAGTGGTGCGGGCGCAGGAAGTGGTGCGGGCGCAGGAAGTGGTGCGGGCGCAGGA 
CGTCCTTCACCACGCCCGCGTCCTTCACCACGCCCGCGTCCTTCACCACGCCCGCGTCCT 190 
AGSGAGAGSGAGAGSGAGAG 



30 AGTGGTGCGGGCGCAGGAAGTGGTGCGGGCGCAGGAAGTGGTGCGGGCGCAGGAAGTGGT 

TCACCACGCCCGCGTCCTTCACCACGCCCGCGTCCTTCACCACGCCCGCGTCCTTCACCA 240 
SGAGAGSGAGAGSGAGAGSG 

35 

X B A S E 

B A V M C 

A M A A R 

1111 1 
40 till I 

I I I I I 

GCGGGCGCAGGAAGTGGGACTCTAGAGGATCCCCGGGCGAGCTCGAATTC 

CGCCCGCGTCCTTCACCCTGAGATCTCCTAGGGGCCCGCTCGAGCTTAAG 290 
AGAGSGTLODPRASSNS 

45 



3. ronstniction of the expression vector. PSY7Q1: 

Plasmid pSP65 (10 /xg, Boehringer Mannheim) was digested with Aatll 
REN, phenol extracted and ethanol precipitated. The DNA was resuspsnded in 10/xl 
50 of HjO. One-half of this DNA was digested with exonuclease III in the following 
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mix: 5 DNA, 10 ^il lOX exonuclease III buffer (600 mM Tris-HCl pH 8.0, 6.6 
mM MgClj, 10 mM P-mercaptoethanol) and 9 units of exonuclease III in a loul 
volume of 200 plI, Samples of 20 ;xl were taken at 0, 1 , 2.5, 5 and 7. 5 min and 
diluted immediately in 100 ^1 of the following buffer (30 mM sodiunii acetate, pH 

5 4.5, 0.25 M NaCl. 1 mM ZnS04) containing 5 fig iRNA and 36 units of SI 

nuclease. Incubation was at 30oC for 45 min and then the reaction was temiinated 
by the addition of 15 fil of stop buffer (0.5 M Tris pH 9.0. 125 mM EDTA, 1% 
w/v SDS, 200 /xg/ml tRNA). The samples were phenol extracted and ethanol 
precipitated. The resuspended DNA was digested with Smal REN and 

10 elecirophoresed through a 1 % gel of low melting point agarose. The gel band 
corresponding to the DNA fragment carrying the P-lactamase gene, the plasmid 
origin and the P-galactosidase promoter was excised from the gel and melted at 
65oC. One volume of H.O was added. The DNA in each sample (timepoint) was 
recircularized by ligation in the presence of agarose. The reaction included 8 ^1 

15 melted gel, 2 ^1 of ligauon buffer (100 mM Tris-HCl pH 7,5, 50 mM MgCL, 50 
mM DTT, 1 mM ATP), 10 units T4 DNA ligase and was incubated ;« 15oC for 3 
hours. Competent cells of JMlOl were transformed with the ligated DNA and 
transformants were selected by growth on L broth plates containing ampicillin (40 
^g/ml). Plasmid DNA was prepared from four transformants. The DNA was 

20 digested with BamHI REN, labeled with ^^P-dGTP using the Klenow fragment of 
DNA Polymerase I, digested with Pvu I and then the smallest fragmisnt was gel 
purified. The fragment from one iransformani was sequenced using the Maxam and 
Gilbert technique. The fragments of the other three plasmids were farther digested 
with TaqI and electrophoresed on the same gel. The sequenced plasmid had a fusion 

25 between the multiple cloning site and a position upstream from the hf-terminal ATG 
of p-lactamase. The size of the BamHI-TaqI fragment of two of the other plasmids 
indicated a fusion between the multiple cloning site and the 4th amino acid of the P- 
lactamase gene. The DNA and corresponding amino acid sequences of the N- 
terminal region of the altered p-lactamase are given below, along with a circular 

30 map of REN sites for pSY701 arc shown in Fig. 1 . The amino acid sequence of 
Fig. 1 is met-thr-met-ile-thr-pro-ser-leu-gly-cys-arg-ser-thr-leu-glu-2isp-pro-his-phe- 
arg-val-ala-leu-ile-pro-phe-phe-ala-ala-phe-cys-leu-pro-val-phe-ala-his. 
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4. Insertinp nf "monomer" SlpT From pSYTOR imn pSYTOl: 

Plasmid pSYTOS was digested with Hindlll, the cohesive ends were filled in 
using the Klenow fragment of DNA polymerase I and then digested with BamHI. 
Plasmid pSY701 was digested with Xbal, filled in as above and then digested with 
5 BamHI. The DNA fragment from pSY708 and the backbone of pSY701 were then 
purified by electrophoresis through a low melting temperature agarose gel and 
purified with NACS (BRL) columns. The appropriate fragments were mixed, 
ligated, and then transformed into £. coli JM109. Transformed cells were selected 
by growth on L plates containing ampicillin (40 mg/ml), IPTG (5x10"^ M) and 

10 XGAL (20 mg/ml). Transformants were analyzed for plasmid contents and one 
(pSY756) was selected for further study since it carried the insert of the monomer 
SIpI-1 sequences in the proper orientation, as determined by mapping of ElEN sites. 
Although the entire DNA sequence was not determined for pSY756, the junctions 
between the insen and vector were verified as correct restriction sequences for Xbal, 

15 upstream and BamHI, downstream. 

5. Multimerization of the SlpI pene of pSY756: 

Plasmid pSY708 was digested with the REN Smal and the DNA fragment 
carr>'ing the coding sequence for the polypeptide arg( ala-gly-ala-gly-ser-gly),, thr- 

20 leu-glu-asp-pro (R(AGAGSG)i3TLEDP) was purified as in 4 above. Plasmid 
pSY756 was digested with Smal, deproteinized and then ligated with the purified 
DNA fragment from pSY708. Transformants of £. coli JM109 were selected on 
medium containing ampicillin. Clones were found to contain 2 units (diraer 
pSY882), 3 units (trimer pSY883), and 4 units (tetramer pSY915) of the original 

25 monomer sequence of the pSY708 clone. Similarly, pentamers and hexamers have 
also been constructed. All of these plasmids are genetically stable and produce the 
gly-ala peptide as a fusion with P*lactama5e. 

6. Expression of the SlpI gene fusion to the ^-lactamase protein: 

30 Synthesis in E. coli cells of the SlpI peptide as a fusion protein with P- 

lactamase was detected by immunoblotting (Western) analysis. Anti-"Slp" 
antibodies were raised against a synthetic silk peptide. Fusions between p-lactamase 
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and SlpI were also detected with antibodies raised against the £. coli P-lactamase. 
As shown in Figure 2, this antibody reacts with dimers and trimers o f SlpI ftised to 
the £. coli P-lactamase. The SlpI insert precedes the fifth amino acid of the signal 
sequence for this enzyme. The P-lactamase antibody (Figure 2A) delects both the 

5 unprocessed fusion proteins as well as the processed mature enzyme which appears 
as the major antigenic band in this figure, at about the 28 kDal position. The 
mobilities of all Sip-containing polypeptides are anomalously slow ard the proteins 
are not as large as they appear on the gels. 

The anti-Sip antibody also is useful in detecting these fusion products. Lanes 

10 2-5 of Figure 2B represent 4 separate clones diat contain dimer fusions of SlpI with 
P-lactamase, while lanes 6 and 7 are from two clones containing trimer fusions. As 
can be seen the antigenicity of the trimer is considerably greater than for the dimer. 
It is known from prior experiments that fusion proteins containing orJy a monomer 
of SlpI are hot detected at all with this anti-Sip antibody. The increased antigenicity 

15 of the trimer peptide allows it to be detected as a processed fusion with the P- 

lactamase signal peptide. The processed form is seen at about the 33 kDal position , 
in lanes 6 and 7 of Figure 2B. The appearance of normally processed p-lactamase 
mature enzyme (detected with p-lactamase antibody) as well as a peptide 
corresponding to the fusion between the SlpI-3 trimer and the signal peptide of P- 

20 lactamase (detected with gly-ala antibody) suggests that despite the insertion of SlpI 
sequences within the signal sequence, normal proteolytic processing of the enzyme 
occurs in E. coli. 

7. a. Expression of the SlpI gene bv fus ion to T7 genes: 

25 The SlpI sequence has also been expressed as a fusion protein with both the 

gene 9 and gene 10 proteins from bacteriophage T7 in £. coli. The construction is 
diagrammed in Figure 3. Plasmid pSY915 (containing the SlpM tetramer) was 
digested to completion with REN Sail and partially with BamHI. Tfcie DNA 
fragment containing the SlpM tetramer was purified and then cloned in plasmid 

30 pSYl 14 (pG2 of Promega Biotech) which had been digested with REiNs Sail and 
BamHI. From this intermediate plasmid, the tetramer insert of SlpI was removed 
with the RENs AccI and EcoRI. This fragment was then cloned in fSY633 
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(pBR322 containing the complete T7 gene 9 sequence; pAR441 of Studic:r et al., 
(1986)) which was digested with EcoRI and AsuII. In the resulting plasinid the SlpI 
tetramer is fused to the gene 9 translational reading frame near the C-tenninus of 
gene 9. This plasmid was then used to transform E, coli strain 0-48 (stnin HMS174 
5 (XDE3) of Studier, et al. , 1986) which contains the T7 RNA polymerase: gene 

inserted into the chromosome under transcriptional control of the IPTG-:inducible P- 
galactosidase promoter. In this configuration, expression of the SlpI-4 sequence is 
dependent upon production of the T7 RNA polymerase which itself is controlled by 
the IPTG inducible P-galactosidase promoter. As shown in Figure 4B and 4C, when 
10 these cells are induced with IPTG a protein product of the gene 9/SlpI-4 fusion gene 
is synthesized and is detected with antibody to the synthetic Sip peptide. The fusion 
product migrates in the gel as if it was 82 kDal in size. The size expected is only 65 
kdal. The anomalous mobility is characteristic of the unusual amino acid 
composition (rich in glycine and alanine) and is seen for all Slp-comainimg products. 
15 In like manner, plasmid pSY638 (pAR21 13 of Smdier) containing the 

promoter region and the first 13 amino acids of the T7 gene 10 protein, was digested 
with REN BamHI, filled in with the Klenow fragment of DNA polymei-ase and then 
dieested with REN EcoRI. Into this linearized plasmid was cloned the AsuII-EcoRI 
fragment of pSY633, containing the SlpI-4 tetramer. This ligation creates an in- 
20 frame fusion of the silk tetramer following the thirteenth amino acid of T7 gene 10. 
The latter fusion product may be used for spinning without further processing since 
the N-terminal 13 amino acids are only a small part of the large Slpl protein. 
Although the fusion product is about 30 kDal in size, it has an anomalous mobility 
and migrates as if it was larger, 50 kDal. This is shown in Figure 4A. 
25 The plasmids pG9/SlpI-4 and pGlO/SlpM were further improved by 

insening a kanamycin-resistance gene in the P-lactamase gene in the orientation 
opposite to the T7 expression system. Thus, any low level expression from the T7 
system does not lead to elevated P-lactamase activity. Such activity eliminated the 
ampicillin in the medium that was added to select for maintenance of the plasmid. 
30 When the ampicillin was depleted the plasmids were lost from the culture. The 
kanamycin-resistance gene circumvents this problem and represents a tsignificant 
improvement in the T7 expression system, especially for large scale cultures. The 
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kanamycin-resistance gene (originally from Tn903) was isolated from a plasmid 
pUC4K (Veira, J. and J. Messing (1982) Gene 19:259-268) as a HincllI fragment. 
The plasmid containing pGlO/SlpI-4 and the kanamycin resistance gene was 
designated pSY997. 

5 

7.b. Fermentation and purification of SlpI-4: 

£. coli strain 0-48 carrying pSY997 was grown at 37oC, using a Chemap or 

a Braun fermcntor, in 10 L of LB to an OD (Klett units) of 300 (3xl0^ells/ml). 

The T7 system was then induced with the addition of 3.5 mM IPTG. After 150 min 
10 the cells were concentrated lOx using a Millipore fiher unit (Pellicon cassette 

system, 100,000 molecular weight cut off filter). The cell suspension was then 

frozen at -70oC until processing. 

The cell suspension was melted in a water bath at 42oC and lysed in a french 

press, and the lysaie was spun at 125,000xg for 1 hour at 25oC. The cleared 
15 supernatant was treated with DNAase (250 /xni/ml) for 15 min at roorti temperature, 

then filtered through a .45 fxm sterile filter. The filtrate volume was measured and 

incubated in ice with slow stirring. Then 231 mg of ammonium sulphate were 

added for each ml of filtrate over a period of 45 minutes. One ml of NaOH for each 

10 g of anunonium sulphate was added to neutralize the pH. 
20 After 2 hours of continuous stirring the mixttire was spun at 9,000xg for 10 

minutes. The pellet was rcsuspended in 1/10 of the original filtrate volume using 

distilled water. The centrifugation and resuspension was repeated three times. The 

pellet was resuspended in 1/10 of the original filtrate volume in distilled water. 

Samples were analyzed for protein concentration, amino acid composition and 
25 protein sequence by standard methods. This is one of several methods for obtaining 

the product. This method results in a SlpI-4 product that is greater tfcian 90% pure. 

The amino acid composition is almost entirely gly, ala and ser, as exi)ected, and the 

N-terminal amino acid sequence is that of the gene 10 leader. 

30 8. Controlled expression of the T7 RNA polymerase gene in Bacillus subtilis: 
The coding sequence of the T7 RNA polymerase gene (T7 geoe 1, T7 
nucleotides 3128 to 5845) from plasmid pSY558 (pARUSl of Studier. et al., 1986) 
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was modified by in vitro mutagenesis of cloned DNA. We inserted the recognition 
sequence for the restriction endonuciease Ndel at position 3171, Using jin 
oligodeoxynucleoiide which was synthesized as previously described, the: T7 gene 1 
sequence was changed from its natural sequence, TAAATG, to the modified 
5 sequence, CATATG. 

Similarly, the upstream regulatory sequence of the Bacillus subtilis gene 
spoVG, obtained from plasmid pCB1291 (Rosenblum, et al., 7. Bacteriology, 
148:341-351 (1981)). was modified by in vitro mutagenesis at position i5 (Johnson, 
et al.. Nature, 302:800-804 (1983)) such that it also includes an Ndel chavage site. 
10 The upstream regulatory sequences of the spoVG gene were then ligated with the 
coding sequence of the T7 RNA polymerase gene via these novel Ndel cleavage 
sites. After u^nsformation of £. coli HBlOl, the plasmid contents of individual 
ampicillin-resistam isolates were checked by restriction mapping. The correct 
construction was named pSY649. 
15 Plasmid DNA containing the spoVG:T7 RNA polymerase fusion gene 

(pSY649) was further modified to include a chloramphenicol-resistance gene that 
functions in B. subtilis. First the Ndel to Sail fragment of about 1200 t»ase pairs 
from plasmid pGR71-P43 (Goldfarb, et al., Nature. 293:309-311 (1981)) was 
isolated. This fragment carries the P43 promoter of B. subtilis and an adjacent 
20 chloramphenicol acetyltransferase gene from Tn9. After filling in all the cohesive 
ends using the Klenow DNA polymerase reaction, this fragment was inserted into 
the Xbal site within the multiple-cloning site of pUC13 (Veiera, et al.. Gene, 
19:259-268 (1982)). Ampicillin and chloramphenicol-resistant transformants were 
selected for further use. The correct plasmid construction was named pSY630. The 
25 Smal to Hindi endonuciease cleavage fragment from plasmid pSY630 containing 
the chloramphenicol acetyltransferase gene fused to the P43 promoter sequence was 
gel purified and blunt-end ligated to the Pvul site of plasmid pSY649 thiat had been 
treated first with T4 DNA polymerase. The resulting plasmid, pSY856, was then 
transformed into B. subtilis 1168. Because plasmid pSY856 is unable to replicate 
30 autonomously in S. subtilis, stable transformants resistant to chloramphienicol must 
result from the integration of the plasmid into the B, subtilis chromosome (Ferrari, 
ei a!., y. Bacteriology, 154:1513-1515 (1983)). The integration event, facilitated by 
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homologous recombination, most likely occurred at either the spoVG or the P43 loci 
of the bacterial chromosome (pSY856 contains DNA sequences homologous to the 
B. subtilis chromosome at only these two sites). The resulung strain, "IBIPoL/ was 
grown both in the presence and absence of chloramphenicol in order to determine 
5 the stability of the selectable marker. Expression of the T7 polymerase was obtained 
and this has no apparent effect on the growth or viability of this strain. 

9,a. Fypression of a plasmid-bom g Target gene rVanamvcin-rcsiStancc) in B. 
.r^Yfrff'/rr Rtrain BTPoL: 
10 The Staphylococcus aureus plasmid pUBllO (Lacey, el al., J. Med. 

Microbiology, 7:285-297, 1974) which contains the gene coding for resistance to the 
antibiotic kanamycin was used to test the expression of the growth-regulated 
spoVG:T7 RNA polymerase gene of strain BlPoL. An EcoRI-BamHI Iragment of 
phage T7 DNA (positions 21 ,402 to 22,858) containing the T7 gene 9 promoter 
15 sequence was purified from plasmid pAR441 (Studier, et al., 1986). This DNA 
fragment was ligated into pUBllO between the EcoRI and BamHI restriction 
endonuclease sites. The resulting plasmid, pSY952, contains the T7-specific 
promoter in the same orientation as the kanamycin-resistance gene. Plasmid 
pSY952 was transformed into B. subtilis 1 168 and BIPoL and these strains were 
20 analyzed for the level of expression of the polypeptide encoded by the plasmid 

derived kanamycin-resistance gene. Approximately 10' cells from growing culmres 
of 1 168, 1 168 containing pUBl lO, 1 168 containing pSY952, BIPoL, IBIPoL 
containing pUBllO, and BIPoL containing pSY952 were obtained at several times 
during the growth and sporulation cycle. The proteins in these cell samples were 
25 processed and analyzed by polyacrylamide gel electrophoresis. 

Because the rate of transcription from the spoVG promoter increases as a 
function of cell density and reaches a maximum during eariy sporulaticm, an 
accelerated accumulation of the target protein is expected in the BIPoL» strain 
containing pSY952 during growth as the culture enters sporulation. The results 
30 show that a protein of molecular weight 34 kDal increases in abundance as the 
culture approaches and enters stationary phase. The size of the protein is in 
agreement with the predicted size of the kanamycin-resistance gene product (Sadaie, 
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ei al., 7. Bacteriology, 141:1178-1182 (1980)) encoded in pSY952. This protein is 
not present in BIPoL or 1168 containing pSY952 which lacks the spoVG- regulated 
T7 RNA polymerase gene or in BIPoL containing pUBllO which lacks the T7 
promoter sequence. The maximum accumulated level of target protein after 24 
5 hours of growth in BIPoL containing pSY952 was 20% of the total cellular protein 
as determined by densitometry. 

9.b Expression of SlpI-4 in B. subtilis: 

Plasmid pGlOSlpI was digested with EcoRI REN. After filling in the 
10 cohesive ends using the Klenow DNA polymerase reaction, the DNA was digested 
with Bglll REN. Plasmid pSY662 was digested with Smal and BamHI RENs. The 
two plasmids were then purified by electrophoresis through a low melting 
temperawre agarose gel and purified with NACS (BRL) columns. The E>NA 
fragment of pGlOSlpI was ligated to the backbone of pSY662 and transfc»rmed into 
15 E. coli containing ampicillin (40 /ig/ml). Transformants were analyzed for plasmid 
contents and one (pSY662/G10/SlpM) was selected for further study. 

Competent cells of B, subtilis BIPol were transformed with 
pSY662/G10/SlpI-4 and incubated at SVC with shaking for 90 minutes. The 
transformation mixnire was then diluted 1:100 in fresh LB containing 10 ^ig/ml of 
20 tetracycline and incubated at 37*C with shaking. Samples were taken and equal 
numbers of cells were lysed and loaded on gels for separation by SDS-Pi^GE. 
Immunoblot analysis was performed using anti-Sip antibodies to detect the synthesis 
of the gene 10/SlpM fusion protein. 

The expression of the SlpI-4 polypeptide in fl. subtilis was detected by its 
25 seroreactivity with anti-Sip antibody, after transfer of the cellular proteins from the 
polyacrylamide gel to a nitrocellulose filter. We verified that the seroreactive 
protein was the product of the SIpI-4 gene by exhaustively treating the cellular 
proteins with CNBr. This should cleave after methionine residues, but since SlpI-4 
lacks methionine it will remain intact. The CNBr treatment eliminated g;reater than 
30 98% of the proteins stainable with Coomassie blue dye. And as expected for a 
protein lacking methionine, SlpI-4 remained intact and still reacted with anti-Sip 
serum. 
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Example ^ 

ARRemhlv and Fxpression of The SIdIII Gene 

1. R\}jnm^Tv of the scheme for assembling the; SlnTTT gene: 

The synthetic SlpIII gene codes for a protein similar to the SlpI gene and to 

5 the crystalline region of the silk fibroin protein made by the silkworm, Bombyx 
mori, SlpIII more closely resembles the silk fibroin molecule because it includes the 
amino acid tyrosine at regular intervals (about 50 residues), whereas multimers of 
Slpl do not. The SlpIII gene was assembled from smaller parts. First, three 
doublestranded sections of DN A of about 60 bp in length were chemically 

10 synthesized. Each section was cloned by insertion into bacteriophage H13 and the 
DNA sequence was verified. These sections were then removed from i:he vector and 
linked together in a specific order. This linkage of about 180 bp is named the SlpIII 
"monomer". "Monomers" were then linked in a specific order to yield dimers. 
trimers, tetramers, etc. of SlpIII. The multimers were then cloned either directly 

15 into plasmid expression vectors to detect the SlpIII protein or initially into an adapter 
plasmid. Insertion of the SlpIII DNA into the adapter allows for furthcrr gene 
manipulation and is further described later. The assembly scheme is pictured as 
follows: 



20 2. Synthesis of double-stranded DNA sections 
The assembly Scheme is pictured as follows: 

Section 1 5' 



25 



Section 2 5* 



30 Section 3 



I M I I I I II I I M I II 



I I M M I I I I I i I I I I 



I I I I I M II I I I II I I 



35 Assembly of "monomer*' 
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Multimerization 



123 



123 



123 



The DNA and corresponding amino acid sequences of the three sections .of 
10 the SlpIII gene are shown in Table 3. 
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The double-stranded DNA sequence is shown in the 5' to 3' direction. The 
amino acids (g = glycine, a = alanine, s = serine, y - tyrosine) coded by the 
sequence are shown immediately below each section. Recognition sequences for 
cleavage by restriction endonucleases are shown above each section. 

5 The above six single-strands were synthesized. After synthesis, the strands 

of DNA were purified and the homologous strands were annealed. About 1 ^1 (0-5 
/ig) of each strand was mixed with 2 /il of lOX A A (description) buffer and 16 of 
sterilized deionized HjO in a 1 .5 ml polypropylene Eppendorf tube. Th<5 tube was 
placed in a boiling water bath (500 ml in a 1 liter beaker) for 10 min and than the 

10 beaker was removed from the hot plate and allowed to cool on the bench to room 
temperature. This required about 1-2 hours. 

Each of the three double-stranded sections was cloned separatel> into 
M13mpl8. Section 1 was ligated between the Smai and BamHI restriction sites of 
the multiple-cloning site. Section 2 was ligated between the BamHI and PstI sites. 

15 And section 3 was inserted between the Pstl and Hindlll sites. The respective clones 
are: M13mpl8.1. M13mpl8.2, M13mpl8.3, The DNA sequence was determined 
for each cloned section. One representative of each section that had the correct 
DNA sequence was recovered and became the material for the next step: assembly 
of the ''monomer". 

20 

3. Assemhiv of the "monomer" of SlpIII: 

The DNA sections 2 and 3 were isolated by digestion of the Ml 3 clones with 
restriction enzymes: for section 2, M13mpl8.2 was digested with BamHI and Pstl ; 
for section 3, M13mpl8.3 was digested with Pstl and Hindlll. The two sections 

25 were purified and mixed together in equal molar amounts with M13mpl8.1 that had 
been first digested with BamHI and Hindlll. T4 DNA ligase was added to link the 
homologous overlapping ends in the order 1-2-3. Due to the hybridizadon 
specificity of the cohesive ends, the three sections are efficiently linked in only this 
order. The DNA sequence of the cloned "monomer" in the assembly ramed 

30 M13mp 18. 1.2.3 was determined to be correct and as shown in 2 above. 
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4. MuUimcrizaiion of tti(^ "monomgr" of Slpirr 

In order to prq)are large amounts of the "monomer" structural gene we first 
subcloned the "monomer" into the plasmid vector pUC12. M13mpl8..1.2.3 was 
digested with EcoRI and Hindlll restriction enzymes. The SlpIII "monomer" was 
5 gel purified and iigated into pUC12 digested with EcoRI and Hindlll. The resulting 
plasmid DNA was prepared, the "monomer" was released from the vector by 
digestion with Ban! REN and the fragment was gel purified. 

To create multimers, "monomer" DNA with BanI ends were linked by 
ligation. The nonpalindromic terminal BanI recognition sequence allows linkage 
10 only in a head-to-tail order. The extent of multimerization is monitonid by gel 
electrophoresis and staining the DNA with ethidium bromide. Multin'.iers of more 
than 20 units have been obtained by this method. 

5. Cloning of t he muliimers of SlpIII: 

15 Plasmid pCQV2 (Queen, et al., 7. AppL MoL Gen., 2:1-10 (1983)) was 

digested with EcoRI and BamHI restriction endonucleases and a fragment of about 
900 bp was purified. This DNA fragment contains the bacteriophage lambda cI-857 
repressor gene, the closely linked rightward promoter, Pr» and the beginning of the 
cro gene. Plasmid pSY335 (described as pJF751 in Ferrari, et al., J. Bacteriology, 

20 161: 556-562 (1985)) was digested with EcoRI and BamHI restriction enzymes and 
subsequently Iigated to the DNA fragment of approximately 900 bp of pCQV2. The 
plasmid obtained from this construction, pSY751 , expresses the p-gal,aciosidase 
gene at 37oC and 42oC. but not at 30oC (Figure 8). 

In this approach the SlpIII gene is first cloned into an "adapter" sequence in 

25 an intermediate plasmid and then subcloned to the expression systems. The adapter 
sequence has the following useftil features: a unique central BanI RE^f site, three 
unique REN sites to either side of BanI, information coding for protein cleavage at 
either methionine, aspartate-proline or arginine amino acids and small size. The 
BanI site is the point of insertion for the SlpIII multimers with BanI ends. 

30 The adapter was synthesized with the Applied Biosystems 380A Synthesizer, 

cloned in M13mpl8 and the DNA sequence verified. The adapter was then 
subcloned into a specially-constructed plasmid vector that lacked BanI REN sites. 
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The recipient plasmid was made as follows. Plasmid pJHlOl (Ferrari, et al., 1983) 
was partially digested with Ahalll restriction enzyme and religated. Tramformants 
of £. coli HBlOl were selected on medium containing chloramphenicol (12.S 
mg/ml). After restriction analysis of several isolates one plasmid was chosen. 

5 pSY325 (Figure 7). This plasmid contains only the chloramphenicol-resistance gene 
and the replication origin (from pBR322) of pJHlOl. After digestion to completion 
with XhoII, pSY325 was ligated with the gel-purified adapter. The result was the 
adapier-plasmid, pSY937. The new pSY937 REN sites were verified. 

The SlpIII multimers were cloned into the BanI site of pSY937 (Fiigure 7). 

10 Positive clones were identified by colony hybridization and with the lower strand of 
section 1 of SlpIII as the DNA probe for hybridization (probe sequence shown in 
Table 2). Positive clones were characterized by gel electrophoresis for the size of 
the inserted muliimer. Finally, the SlpIII sequences were subcloned usinc: the REN 
site in the flanking adapter regions to specific locations of expression plasmids. 

IS The SlpIII protein had the following amino acid composition: 

SlpIII 1178 AA MW 83,000 

( f m ) DPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
GAGS (GAGAGS) e GAAGY 
20 t ( GAGAGS ) 9 GAAGY] ^ n 

GAGAGSGAGAGSGAGAMDPGRYQLSAGRYHYQLVWCQK 

(fm) intends the initiation codon 



SlpIII Expression Vector 

25 Plasmid DNA pSY1086 is a pSY937 derivative containing 19 repeals of 

SlpIII (3.5 kb). This plasmid DNA was digested with Nnil and PvuII and the 
fragments separated by agarose gel electrophoresis. The purified SlpIII multimer 
was then cloned in plasmid pSY751 digested with PvuIII REN. Several clones were 
analyzed and one (pSYlOOS) was chosen to be used in expression experiments and 

30 SlpIII purification. 

The ampicillin drug resistance gene of pSYl(X)8 was substimted v/ith the 
kanamycin marker from pSYlOlO (produced by digestion of pSY633 witli Dral and 
Sspl and insertion of Kan*^ obtained by Hindi digestion of pUC4K) and the 
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subsequent plasmid was called pSYl 186. By removing the SlpIII portion of plasmid 
pSY1186 with BanI, a new plasmid, pSY1262, was generated. This plasmid 
contains a unique BanI site which allows for the direct ligation of fragments 
containing BanI ends obtained by polymerization of monomers. This plasmid has 
5 been used to generate plasmids containing inserts for the following proteins: SELPl, 
2, 3, and Slp4. 



Production and Purification of S lpTTY Cell Culture 

E, coli are cultured in the following medium: 
10 Mcdiym C 





ill 


yeast extract 


20 


casamino acids 


20 


peptone 


20 


gelatin peptone 


20 


KHjP04 


2 


KjHPO, 


2 


NajHP047HjO 


2 


glucose 


2 


ampicillin 


0.1 



An overnight culnire (500 ml - 1 1) which had been grown at :lOoC was used 
to inoculate 375 1 of media contained in a 500 1 fennentor. Fermentor conditions 
include a tachometer reading of 100 rpm, vessel back pressure of 5 psii and an air 

25 flow of 170 l/min in order to maintain dissolved O, at greater than 50%. 

Glucose (1 g/1) and ampicillin (0.05 g/1) were added to the fermentation 
when the culture reached an ODgjo of 1.0 and again at 2.0. When the culture 
reached an 0T>^^ of 2.0 the temperature was increased to 42oC for 1(1 minutes and 
then lowered to 38**C for 2 hours. The culture was then chilled to IQoC and cells 

30 were harvested by centrifugation in a continuous centrifuge and frozen at -70oC until 
processed. Yields from two separate fermentations were 7.3 kg and 5.2 kg wet 
weight of cells. 
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It should be noted that other media can be used and, with different plasmids, 
various selection conditions can be imposed (i.e., substitution of kanamycin 
selection for ampicillin). These conditions have been used in laboratory scale 
fermentations (10 I volumes). 

Cell Lysis 

Method 1. Cells were thawed and suspended to a concentration of 1 kg wet 
weight/6 1 in 50 mM Tris-HCl pH 7.0, 1 mM EDTA and broken by 2 passages 
through an APR Gaulin cell disrupter at 80(X) psi. During this lysis procedure the 
cells were kept cold with an ice bath. The cell lysate was then centrifugcd at 
26.000xg with a continuous centrifuge, such as the T2-28 rotor in a Sorvall RC5B 
refrigerated centrifuge operated at 4oC. Under these conditions greater than 90% of 
the SlpIII produced can be found in the pellet. The supernatant did contain some 
product which could be recovered by NH^S04 precipitation as described below. The 
pellet was extracted with LiBr as described below. 

Method 2. Frozen cells were thawed and resuspended to a concentration of 
1 kg wet weight in 6 L in 50 mM Tris-KCI pH 7.0, 10 mM EDTA, and 5 mM 
PMSF to inhibit protease activity. Cells were stirred in this buffer at rocm 
temperature for 0.5 to 2 hours, then lysozyme was added to a concentrat ion of 1 g/I 
and incubation is continued for 20 minutes. P-Mercaptoethanol was then added to 
70 mM and the detergent NP40 was then added to a final concentration of 1 % for 
20 min while continuously stirring the cell suspension. Then MgCU was added to 
50 mM followed by DNAse at a concentration of 1 mg/1 and incubation was 
continued at room temperature for 20 minutes. The cell lysate was then cemrifuged 
as in method 1 at 26,0(X)xg in a continuous centrifuge and the supernatant was 
collected and passed through the continuous centrifuge a second time at 26,000xg. 
The supernatant resulting from this second centrifugation contained <5% of the 
total Slpni, but what was there could be recovered with NH4S04as described below. 
The pellets resulting from the 1st and 2nd 26,000xg centrifugations were: combined 
and extracted with LiBr as described below. 

Method 3. For this method, a strain of £. coli was used that contains a 
second plasmid which encodes the T7 phage lysozyme. This plasmid is compatible 
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with the plasmid encoding the SlpIII gene and the drug resistance deteiminant. The 
strain was grown in the same medium and under the same conditions ais in the first 
two methods. However, due to the production of the T7 lysozyme insiide the cells, 
their cell wall was weakened and they could be easily lysed at the completion of the 
5 fermentation by the addition of EDTA to > 100 mM and NP40 to a concentration 
of from 0.5 to 1 .0% v/v. Lysis could also be achieved by the addition of chloroform 
(20 ml per liter) of fermentation broth instead of NP40. Alternatively, cells could 
be collected by centrifugation prior to lysis, resuspended to 1 kg wet weight in 6 L 
in Tris-EDTA as described in the first two methods and then lysed by ihe addition of 
10 NP40 or chloroform. Following cell lysis by either method the lysate was 

centrifuged in a continuous rotor at 26,000xg as described in the first two methods. 
As with those methods, LiBr extraction of the pellet and NH4SO4 precipitation of the 
supernatant was used to recover the product. 



15 Purification of SlpIII 

The pellet obtained by centrifugation of the cell lysate at 25,00i3xg as 
described above was extracted with an equal volume of 9 M LiBr. Thii salt solution 
was added and the pellet was evenly suspended by stirring at room temperature 
(RT). The mixture was stirred for 1 hour at RT after an even suspension was 

20 obtained. The mixmre was then centrifuged at 26,000xg in a continuous rotor at 
40C or at RT to generate a pellet and a supernatant fraction. The supernatant was 
saved and the pellet was re-extracted with another equal volume of 9 M LiBr as 
above. After mixing for 1 hour the mixture was centrifuged at 26,000jcg and the 
supernatant from this centrifugation was combined with the supernatant from the 

25 first LiBr extraction and allowed to stand at 4oC overnight. ApproxinKitely 90% of 
the SlpIII contained in the cell lysate 26,000xg pellet was extracted by LiBr using 
this procedure. 

After the LiBr extract stands overnight at 4oC a precipitate formed, was 
removed by centrifugation at 26,000xg and was discarded. The supernatant was 
30 then placed in dialysis bags and dialyzed against several changes of dU^O for 2 days. 
As the LiBr was removed by dialysis the SlpIII product precipitated in the dialysis 
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bags. The precipitate was collected by centrifugation and washed 2-3 times with 
dHjO. The final washed product was centriftiged and dried by lyophilizauon. 

For the recovery of SlpIII from the 26,000xg supernatant fractions, NH4SO4 
precipitation was used. Solid NH4SO4 was slowly added to the sample which was 

5 maintained at 4oC, until 38% saturation was achieved (231 g/1). The mixture was 
then stirred at 4"C for 2-3 hours. The precipitate was recovered by centrifugation in 
a continuous flow centrifuge and washed 4-5 times with an equal volume of distilled 
H-,0 or with 0.5% SDS in HjO. After each wash the precipitate was recovered by 
continuous centtifugation. The pellet became increasingly white with successive 

10 washes as contaminating protein was removed. SlpIII was recovered as a washed 
pellet and was dried by lyophilization. 

Trypsin Treatment 5;tep nf SloIII 

SlpIII was suspended in 50 mM Tris HCl, pH 8.0, 0, 1 M NaCl buffer, and 
15 was placed in a 37°C water bath, and TPCK Ureated trypsin solution was mixed into 
the suspension. The final trypsin concentration was 0. 1 %. After 3 hours, the 
solution was centriftiged at 16,000xg for 15 min, the pellet was washed with a half 
equal volume of 0.5% SDS in H^O first, then with distilled water. After each wash 
the pellet was recovered by centrifugation. The final product was resuspended in 
20 water and kept at 4^*0 for further analysis. 

With the trypsin treatment, SlpIII was purified to 99.4% purity. 

Phvsical Measurements of SlpIII 

Physical measurements of the purified silk-like proteins have been compared 

25 with those of Bombyx mori silk in order to establish that the repetitive amino acid 
polymers produced microbiologically accurately mimic the properties of naturally 
occurring polymers. Physical measurements were performed to confirm the model 
of anti-parallel chain pleated sheet conformation for the crystalline regions of 
Bombyx mori silk fibroin (Marsh, Corey and Pauling, Biochem. Biophys, Acta 

30 (1955) 16; Pauling and Corey, Proc, NatL Acad, Sci. USA (1953) 39:247). 
Preliminary analysis of x-ray diffraction patterns obtained from Sip films are 
consistent with those described by Fraser, MacRai, and Steward (1966) (Table 4). 
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Circular Dichroic (CD) and Fourier transform infrared (FOR) spectroscopic 
analysis of Slpin were consistent with a high degree of extended P and p-tum 
conformations. Comparisons of the spectra obtained from SlpIII with that of 
naturally occurring silk fibroin in various solvents (Isuka and Young, Proc. NatL 
5 Acad, ScL USA (1966) 55: 1 175) indicated that SlpIII in solution consists of a 
mixture of the random and highly ordered structures seen in silk fibroins. 



Tahle 4 



Material 

10 (AG)„ 

(AGAGSG)„ 
CTP fraction 
Native fibroin 



15 SlpIII 

Referenced in Fraser ei al., 7. Mol. Biol. (1966) 19:580 



aiA) 






9.42 


6.95 


8.87 


9.39 


6.85 


9.05 


9.38 


6.87 


9.13 


9.40 


6.97 


9.20 


9.44 


6.95 


9.30 


9.38 


6.94 


8.97 



Example 4 

20 FBSl Gene C nnttmction: 

Six oligonucleotide strands were synthesized and purified as described 

previously. 



(HIII) Banll StuI 
i. q' AnPTQGGCTC TGGAGTAGGCCTG3' 

li. 5'£AIICAfifi££IACTCCAS5a£CCS:3' 
(ERl) StuI Banll 

(HIII) BanI 

iii . ' ACiPTTQGTGCCA GGTGTAGGAGTTCCGGGTGTAGGCGTTCCGGGAGTTGG 
TGTACCTGGAGTGGGTGTrCCAGGCGTAGGTGTGC3 ' 

(Xmal) 

i V . 5 'CCfiSfiCACACCTACGCCTGGAACACCCACTCCAGGTACACCAACrCCCG GA 
ACGCCTACACCCGGAACTCCTACACCTSG£fl£CA3 ' 

BanI 



40 



25 



30 



35 
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(Xmal) Ahall 
V . 5 'CCfiSfiGTAGGAGTTACCAGGGGTASSSSaiCCCTGGAGCGGGTGCTGGTAG 
CGGCGCAGGCGCGGGCTCCGGAGTAG GGGTGCC G5' 

Banll Bani 
(ERI) Bani Banli 

vi. S' AATTCGGgAgg gOTACTCCGGAGCggQmPCTGC'GCCGCTACrAGgACCCG 
CTCCAGGSaCfiCCTACCCCTGGTACTCCTACCB ' 
Ahall 

Oligonucleotide strands (iii), (iv), (v) and (vi) were annealed and ligated 
with the DNA of plasmid pBSnil3(+) (Stratagene) which had been digested with 
Hindlll and EcoRI. The products of this ligation reaction were iransfonned into £. 
coli strain JM109. Transfonnant colonies were selected for resistance ic* ampicillin. 
Colonies were screened for their hybridization with "P-Iabelled oligonucleotides 
(iii), (v). Plasmid DNA from several positively hybridizing clones was purified and 
sequenced. Two of the plasmids, pSY1292 and pSY1293. contained the sequence 
shown for oligonucleotides (iii), (v) and (iv), (vi). These sequences contained all of 
the nucleotides present in these synthetic oligonucleotides except one. A. G:C 
basepair was missing at position 7 (iii). The lack of this basepair obstructed one of 
the Bani sites. In order to introduce a second Banll site at the 5' end of the gene 
fragment, oligonucleotides (i) and (ii) were annealed and ligated with plasmid 
pBSml3(-f ) which had been digested with Hindlll and EcoRI. Plasmid DNA from 
the transformant colonies resistant to ampicillin was purified. Two plasmids, 
pSY1295 and pSY1296, which were digestible with StuI, a unique site contained in 
the oligonucleotide sequence, were sequenced. They were both shown to contain the 
sequence shown for oligonucleotides (i) and (ii). Plasmid DNA From pSY1292 was 
digested sequentially with Hindlll, SI nuclease, and EcoRI. The digestion products 
were separated by electrophoresis in an agarose gel and the DNA fragment of 
approximately 150 basepairs was excised from the gel. This DNA fragment was 
ligated with plasmid DNA pSY1296 which had been digested with StuI and EcoRI. 
The products of this ligation reaction were transformed into £. coli strain JM109 and 
were selected for resistance to ampicillin. Colonies were screened for hybridization 
to "P- labelled oligonucleotide (v). The plasmid DNA from two positively 
hybridizing clones was purified and sequenced. These plasmids were named 
pSY1297 and pSY1298. They contained the following sequence: 



62 



wo 98/10063 



1PCT/US96/1S306 



(Hindlll) Banll 

AGf^nf^GOTCT GGAGTAGCTGTGCCAGGTGTAGGAGTrCCGGGTGTAGGCGTlXrCGGGAG 6 0 
TCGACCCGAGACCTCATCCACACGGTCCACATCCTCAAGGCCCACATCCGCA/lGGCCCTC 

Xmal 

TTnriTCTAPrrGGAGTCGRTfrrTggAGGCGTARgTGTG CggGGGG TAGGAGTjLCCAGGGG 120 
AACCACATGGACCTCACCCACAAGGTCCGCATCCACACGGGCCCCATCCTCArGGTCCCC 



Banll 

1 0 TAGGCGTCCCTGGAGCGGGTGCTGGTAGCGGCGCAGGCGCSSfiClCCGGAGTilGGGGTGC 180 

ATCCGCAGGGACCTCGCCCACGACCATCGCCGCGTCCGCGCCCGAGGCCTCAirCCCCACG 

ECORI 

15 GCTTAAG 



FRST Multimer Gene Assembly: 

The BanI acceptor plasmid pSY937 was modified in order to ai:cept Banll 
terminal cohesive DNA fragments. Two oligonucleotides were synthesized for this 
20 purpose. 

(BatnHl) Dral Sspl Nrul Banll 

vii . S' GATCCT ATG TTTAAATATT CTCGCGAACGTTTTTCTATGGGCTCGATGTGr 

TACCGIfiCfiCaTGGATATCAGCTG3 ' 
Fspl EcoRV PvuII 

25 

(BamHI) PvuIIEcoRV Fspl Banll 

Vi i i . ';^ GATPrAGrTGATAC C!ATGCGCAGGGTAACACATCGAGCCCATACAAAAA 
rGTTCGCGAGAATATTTAAACATAG3 ' 

Nrul Sspl Dral 

30 

Oligonucleotides (vii) and (viii) were annealed and ligated with plasmid 
DNA pSY937 which was digested with BamHI. The products of this ligation were 
transformed into E. coli strain JM109 and colonies were selected for nssistance to 
chloramphenicol. Transformant colonies were screened by hybridization to "P- 

35 labelled oligonucleotide (vii). Plasmid DNA from two positively hybridizing 

clones, pSY1299 and pSY1300, contained the sequence shown for oligonucleotides 
(vii) and (viii), as determined by DNA sequencing. 

Plasmid DNA pSY1298 was digested with Banll and the digestion fragments 
separated by agarose gel electrophoresis. The EBSI gene fragment, approximately 

40 150 base pairs, was excised and purified by electro-elution and ethanol precipitation. 
Approximately 1 /ig of purified fragment was self-ligaied in order to produce 
multimers ranging in size from 450 bp to 6,000 bp. The products of tlie self- 
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ligation were then iigated with plasmid DNA pSY1299 which had been digested with 
Banll. The products of this ligation reaction were transformed into £. coli strain 
HBlOl . Transformants were selected for resistance to chloramphenicol. Plasmid 
DNA from individual transformants was purified and analyzed for increased size due 
5 to EBSI multimer DNA insertions. Ten clones (pSY 1240- 1249) with insens ranging 
in size from 1.5 Kbp to 4.4 Kbp were obtained. 

Rxrm^rif^!^ nf FRST Multimer Gene: 

One of these clones, pSY1248, which contained a 4 Kb EBSI multimer gene 

10 was recloned in the XP^ expression vector, pSY751, Plasmid DNA from pSY1248 
was digested with Nrul and PvuII, separated by agarose gel electrophoresis, and the 
DNA band corresponding to the EBSI multimer gene was excised and purified by 
NACS purification. DNA from plasmid pSY751 was digested with PvuO and 
Iigated with the NruI-PvuII fragment from pSY1248. The products of this ligation 

15 were transformed into E, coli HBlOl, and the transformants selected for resistance 
to ampicillin. Two clones were isolated containing the new plasmid pSY1280. £. 
coli cells containing pSY1280 were grown at 30oC to an ODgoo of 0.7 and then 
shifted to 42oC for 1.5 hours. The proteins produced by these cells was analyzed by 
SDS-PAGE. The separated proteins were transferred to nitrocellulose paper and 

20 detected by immunoreactivity with anti-ELP rabbit serum. A strongly reactive 
protein band was observed with an apparent molecular weight of 120 kE>aL 

The Ampicillin drug resistance gene of pSY1280 was substimted with the 
Kanamycin marker and the subsequent plasmid was called pSY1332. Tlus plasmid 
was used in fermentation for the purification of EBSI (see Methods). 

25 

pSyi332/pSY1280 EBSI Protein 1413 AA MW 113,159 

MDP WLQRRDWENPGVTQLNRLAAHP P FAS ERFCMGS 
[ (GVGVP) , (GAGAGSGAGAGS) , J „ 
30 MCYRAHGYQLSAGRYHYQLVWCQK 
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piirifiraTi^n FBSI Protein: 

£. coli strain HBlOl containing plasmid pSY1280 was fermented in 10 L 
volume. The cells were concentrated by filtration and further harvested by 
centrifugation. Pelleted cells were stored frozen at -70oC until processed. Frozen 
5 cells were thawed on ice and suspended in 4 ml of 50 mM Tris-HCI pH 7.0, 10 mM 
EDTA. 5 mM PMSF per gram wet weight of cells. The cells were bn)ken by 
French pressing twice at 15,000 psi and then cooled to OoC. The cmds lysate was 
cleared by centrifugation at 26,000xg for 20 minutes. The supernatant proteins were 
precipitated by addition of solid ammonium sulfate to 20% of saturation (114 g/1). 

10 The precipitate was collected by centrifugation at 10,000xg for 10 minites. The 
pellet was resuspended in 10 ml ofUJd and dialyzed against 10 mM Ti is pH 8.0, 
0. 15M NaCl at 4oC. The dialyzed solution was digested with 0. 1 % Tiypsin 
(Sigma) for 1.5 hours at room temperamre, and reprecipitated with 20% ammonium 
sulfate. The precipitated protein was resuspended in HjO and dialyzed against 10 

15 mM Tris pH 7.0, 1 mM EDTA at 4oC. The protein purity of this sample was 
analyzed by amino acid composition and determined to be 83%. 

Elastic Prnperties of EBSI Protein: 

The soluble preparation of semi-purified EBSI protein described above was 

20 incubated at 37oC for 30 minutes and centrifuged at 10,000xg for 10 nin at room 
temperamre. This treatment caused the EBSI protein to aggregate, become 
insoluble, and pellet into a translucent solid. The solid was resistant tc» mechanical 
disruption either by vortexing or by maceration using a glass rod. The solid could 
be cut with a razor blade into strips which exhibited a high degree of elasticity. 

25 These strips fully retained their shape after repeated extensions and reliixations. 
They resisted compression with no apparent irreversible defomation of stricture. 

EBSI Purification 

EBSI sample C70% pure) was dialyzed in 50 mM Tris HCl, 50 mM NaCl, 
30 pH 8.0 at 4*C overnight with one change of buffer. If precipitation was observed, 
the sample was centrifuged at 27,000xg for 15 min at 4oC. All remaining steps 
were performed at 4oC. The supernatant was applied to a DEAE-Sephacel column 
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which had been equilibrated with 50 mM Tris HCl, 50 mM NaCl, pH 8.0. The 
flow through fractions which contained EBSI were collected and pooled. NaCl was 
added to the pooled fractions from DEAE-Sephacel column to make a fin;il 
concentration of 2 M NaCl in the sample. Insoluble material was removed by 

5 centrifiigation at 27,000xg for 20 minutes. The supernatant was then loaded onto 
Phenyl-Sepharose column which was equilibrated with 50 mM sodium phosphate 
buffer, pH 7.0, with 2 M NaCl. The colunm was washed extensively with buffer 
until no eluting protein was detected by Ajgo- The column was then eluted stepwise 
with 50 mM sodium phosphate buffer, pH 7.0 and finally with water. Thie EBSI 

10 active fractions were pooled and stored at 4**C for further analysis. 

With the addition of these steps to the previous procedures, 100% pure EBSI 
was obtained. 



Example 5 

15 ELPT Constn ictinn and Expression 

Two oligonucleotide strands were synthesized and purified as desc:ribed in 

the Methods section. 

(EcoRI) BanI Smal 
i ) 5 ' AATTCGGTGCC CGGTGTAGGAGTTCCGGGTGTAGGCGTT CCCGGGG TAG 

20 GCGrrCCGGGAGTAGGSfiiaCCA3 ' 

BanI 

BanI Smal 
i i ) 3'G ggACGGG CCACATCCTCAAGGCCCACATCCGCCASGG££CCATCCGCA 
25 AGGCCCTCATCCCCaCSfillCfi&S ' 

BanI (Hindlll) 



The two oligonucleotide strands were annealed and ligated with thie DNA of 
plasmid pBS ml3(+) (Stratagene) which had been digested with RENs Hindlll and 
30 EcoRI. 

The products of this ligation reaction were transformed into £. coli strain 
JM109. Transformant colonies were screened for their hybridization witli ^^P- 
labeled oligonucleotide (i). Plasmid DNA from positively hybridizing clones was 
purified and sequenced. One plasmid, pSY1287, contained the sequence shown for 
35 oligonucleotides (i) and (ii). 
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Plasmid DNA from pSYl287 was digested with BanI REN and the digestion 
fragments were separated by agarose gel electrophoresis. The ELPI geine fragment, 
approximately 60 bp. was excised and purified by NACS column. AppToximately 1 
fig of purified fragment was self-ligated in order to produce multimers ranging in 

5 size from 300 bp to 5000 bp. 

The products of the self-Iigation were then ligated with plasmid DNA 
pSY937 which had been digested with REN Banl. The product of this ligation 
reaction was transformed into £. coli strain HBIOI. Transformants were selected 
for resistance to chloiamphenicol. Plasmid DNA from individual transformants was 

10 purified and analyzed for increased size due to ELPI multiple DNA insertions. Four 
clones (pSY1388-1391) with inserts ranging in size from 1.0 kbp to 2.:5 kbp were 
obtained. These clones were recloned in the APR expression vector pJ»Y751 . The 
clones obtained (pSY1392-1395) were used for expression of ELPI. 

15 The ELPI protein had the following amino acid composition: 

PSY1395 ELPI Protein 859 AA MW 72,555 

MDPWLORRDWENPGVTQLHRLAAHPPFARNILAIRW 
[ (VPGVG) ,1 ., VPWTRVDLSAGRYHYQLVWCQK 

20 fiFT PI Gene rnn«!tnictinn and Expression 

Two oligonucleotide strands were synthesized and purified as described in 

the Methods section. 

Fspl Pvull SnaBI (PstI) 

(i) ' - nTGCGCA BCTGGI&CgraGCISga- 3' 

25 

(PstI) PV"II 

(ii) 3'.acsi£acfi£fiicGAccaiscaicG-5' 

FspI SnaBI 

30 These oligonucleotide strands were annealed and ligated with jslasmid 

pSY1304 which had been digested with Psfl REN (pSY1304 differs from pSY857 by 
having a monomeric unit in place of the trimeric unit of pSY857). Plasmid DNA 
from transformant colonies resistant to chloramphenicol was purified. One plasmid, 
pSY1365, which was digestible with REN SnaBL was sequenced and proven to be 

35 correct. 
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ELPI gene fragment purified as described (ELPI construction and 
expression) was treated with Mung Bean Nuclease as described by supplii^r 
(Stratagene). The DNA fragments mixture was then ligated with plasmici DNA 
pSY1365 which had been digested sequentially with RENs Fspl, SnaBl and calf 

5 intestinal phosphatase. The products of this ligation reaction were transformed into 
E. coli strain HBlOl and were selected for resistance to chloramphenicol. Plasmid 
DNA from individual transformants was purified and analyzed for the ELPI 
monomer DNA insertion. Two plasmids, pSYl366 A and B, were sequenced. 
They were both shown to contain the ELPI DNA sequence in the correct orientation. 

10 Plasmid DNA pSY1365 was digested with REN BanI and the DN A fragment 

containing the SELPl monomer was gel purified. To create multimers, I ^g of the 
SELPl DNA fragment was self-ligated. Multimers were obtained ranging in size 
from 500 bp to 10 kbp. The SELPl multimers were cloned into the BanI site of 
pSY1262. 

15 Positive clones were characterized by gel electrophoresis for the 2;ize of the 

insened multimer and used for expression and protein analysis. 



pSY1396 SELPl Protein 2025 AA MW 148,212 

20 MDPWLORRDWENPGVTQLNRIAAHPPFASDPMGAGS ( GAGAGS ) 4 

[GAA (VPGVG) , VAAGY (GAGAGS) ,] 

GAA ( VPGVG ) 4 VAAGY ( GAGAGS ) j G AGAMDPGRYQLSAGRTHYQLVWCOK 



SELP2 - Monomer Construction 
25 Plasmid DNA pSY1298 was digested with Banll REN and the EBSI gene 

fragment was purified as described previously. The EBSI monomer fragment was 
ligated into pSY1304 (pSY937 containing a monomer of SlpIII, constructed as 
pSY857) which had been digested with Banll REN and treated with calf intestinal 
phosphatase). 

30 The products of the ligation mixture were transformed in £. coli strain 

HBlOl . Transformants were selected for resistance to chloramphenicol. After 
restriction analysis of several isolates, one plasmid was chosen. pSY130;i , 
containing a DNA fragment corresponding to the SELP2 monomer gene. 
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"^^T^f? - Mv'^T'^ ^^"^ Assemhlv anr^ Fypression 

Plasmid DNA pSY1301 was digested with REN BanI and the DN A fragment 
containing the SELP2 "monomer" was gel purified. To create multimers, 1 fig of 
the SELP2 DNA fragment was self-ligated. Multimers were obtained greater than 
5 12 kb in size. 

The SELP2 multimers were cloned into the BanI site of pSY1262. Positive 
clones were characterized by gel electrophoresis for the size of the insert(;d 
muliimer. The clones with inserts ranging in size from 1.5 kb to 11 kb were 
selected. Plasmid DNA pSY1372 containing an insert of 6 kb (18 repeats) was used 
10 for further analysis and protein purification. 

^J^;\ f7 . Protein Purification 

£. coli strain HBlOl containing plasmid pSY1372 was fermented according 
to the procedure described in Methods for fermentation. The cells were harvested 

15 by ccntrifugation. Pelleted cells were stored frozen at •70'*C until processed. 

Frozen cells were thawed on ice and suspended in 4 ml of 50 mM TrisHCl, pH 7.0, 
10 mM EDTA, 5 mM PMSF per gram wet weight of cells. The cells were broken 
by passing through a Gaulin cell disrupter at 8,000 psi. The crude lysate was 
cleared by centriftigation at 26,000xg for 20 minutes. The supernatant, which 

20 contained > 75 % of the SELP2 protein, was precipitated by addition of 20% 
ammonium sulfate (1 14 g/L). The precipitate was collected by centriftigation at 
10,000xg for 10 minutes. The pellet was resuspended in 10 ml of HjO aiid dialyzed 
against 10 mM Tris pH 8.0, 0.15 M NaCl at 4oC. The dialyzed material was 
centrifiiged at 26,000xg for 15 min in order to collect the insoluble fraction of 

25 protein which contained approximately 10% of the SELP2 protein. This insoluble 
protein pellet was washed twice in 0.2% SDS at 50oC for 30 min with occasional 
shaking. The insoluble protein was collected each time by ceniriftigatiom at 
26.000xg for 15 min followed by a wash of 50% ethanol. The final prorein pellet 
was resuspended in water and analyzed by Western blot analysis and amino acid 

30 composition. By Western blot the SELP2 protein appears to be homogeneous in size 
consistent with its large molecular weight {> 150 kDal). By amino acid composition 
the SELP2 preparation is approximately 80% pure and the observed molar ratio of 
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<;rP4 - Constmction an^ Frpression 

Plasmid DNA from pSY1304 was partially digested with REN Haell and the 
digestion fragments separated by agarose gel electrophoresis. The larger DNA 
fragments were excised and purified by NACS colunm. The purified fragments 

5 were selMigaied, the ligation reaction was heated at 70oC for 15 min to inactivate 
the T4 DNA Hgase and evenmally digested with REN Psd. The digestion mixnire 
was then transformed into £. coli strain JM109. Transformants were selected for 
resistance to chloramphenicol. Plasmid DNA from individual transformants was 
purified and analyzed for: (1) resistance to REN PstI; and (2) deletion of 60 bp 

10 Haell fragment contained within the SELP2 gene fragment. One clone (pSY1378) 
satisfied both requirements. Plasmid DNA pSY1378 was digested with I^IEN BanI 
and the DNA fragment containing the SLP4 monomer was gel purified. To create 
multimers, 1 ^g of SLP4 DNA was self-ligated. Multimers were obtaimsd ranging 
in size from 300 bp to 6 kbp. The SLP4 multimers were cloned into the BanI site of 

15 pSY1262. Positive clones were characterized by gel electrophoresis for the size of 
the inserted multimer and used for expression and protein analysis. 

pSyi398 SLP4 Protein llOX AA MW 76,231 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPMGAGS [ { GAGAGS ) 
20 ( GAGAGS ) 4 GAGAMDPGRYOLS AGRYHYQLVWCQK 

^'^P-f^y rFin f-^T ^^ ConsTniction and Rxpression: 

The SLPIII polymer was chosen as a backbone structure for insertion of a 
biologically functional sequence because of its predicted stnicnire, alloviing for 

25 fabrication of useful products; having good strucmral properties for use in a wide 
variety of applications; having B-nim structures between interactive strands; and 
allowing for substinition of the turn sequences with other sequences. Tlie 
fibronectin cell-binding domain, amino acids 1405-1512, has a strong tiim 
propensity, with the tripeptide RGD providing for cell attachment, predicted to be 

30 present within a hydrophilic loop between adjacent B-strands. A 10 amino acid 
sequence spanning this proposed loop stnicmre (referred to as fibronectiin cell- 
binding or FCB sequence) was chosen to constimte the functional block of amino 
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acids to be inserted within the SLPIII backbone. The insertion site within the 
SLPIII backbone was chosen to correspond with the amino-acid sequence GAAGY 
which is also predicted to provide a turn structure (Chou and Passman, 
Biochemestry, 13:222-244 (1974)). The design allows for conservation of the FCB 
5 structure while causing minimal disruption of the SLPIII (GAGAGS), B-strand 
crystal-packing domains. 

The SLPIII gene monomer contains a PstI restriction endonuclease site 
within the sequence encoding the proposed turn stnicmre, GAAGY. This; site was 
used to insert the synthetic DNA encoding the 10 amino acids of the FCB sequence. 
10 Two complementary DNA strands comprising the FCB site, 36 bases in length, were 
synthesized consisting of the sequence shown below: 

5' - GTGACTGGCCGTGGTGATAGCCCGGCTAGCGCTGCA -3' 

3' - ACGTCACTGACCGGCACCACTATCGGGCCGATCGCG -!>' 

15 These oligonucleotides were purified according to the procedures described 

in Example 1, and cloned into the PstI site of pSY1304. pSY1304 DNA was 
digested with PstI and ligaied with a mixmre of the FCB oligonucleotides. The 
ligation reaction products were transformed into E. coli cells. Colonies containing 
the plasmid were selected on bacterial culture plates containing the antibiotic 

20 chloramphenicol. Individual colonies were grown and plasmid DNA purified and 
analyzed for the presence of the FtB oligonucleotide sequence by restriction 
digestion with Nhel. Plasmids containing this restriction site were subjected to 
DNA sequencing and two candidates were shown to be correct. The partial 
nucleotide sequence of one of these, pSY1325, and the encoded amino-acid sequence 

25 were as follows: 



Ban I 





GGT 
G 


GCC 
A 


GGC 
G 


AGC 
S 


GGT 
G 


GCA 
A 


GGA 
G 


GCC 
A 


GGT 
G 


TCT 
S 


GGA 
G 


GCT 
A 


GGC 
G 


30 


GCG 
A 


GGC 
G 


TCT 
S 


GGC 
G 


GCG 
A 


GGC 
G 


GCA 
A 


Bam 
GGA 
G 


HI 

TCC 
S 


GGC 
G 


GGA 
A 


GGC 
G 


Gcr 

A 


35 


GGT 
G 


TCT 
S 


GGC 
G 


GCA 
A 


GGG 
G 


GCA 
A 


GGC 
G 


TCT 
S 


GGC 
G 


GCA 
A 


GGA 
G 


GCG 
A 


GGG 
G 
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Pst I 

TCT GGA GCT GCA GTG ACT GGC CGT GGT GAT AGC CCG GCT 
SGAAVTGRGDSPA 

5 Pst I 

AGC GCT GCA GGC TAT GGA GCT GGC GCT GGC TCA GGT GCT 
SAAGYGAGAGSGA 

Ban I 

10 GGA GCA GGA AGC GGA GCG GGT GCC 
G A G S G A G 

The FCB-SLP monomer gene fragment was purified from pSY1325 by 
digestion with BanI, agarose-gel electrophoresis, and NAGS purification (Example 

15 1). The monomer gene fragment was selMigated and cloned into pSY937 which 
had been digested with Banl. The products of this ligation were tranjiformed into E. 
coll and selected for growth on chloramphenicol. Plasmid DNA from individual 
colonies was analyzed for inserts containing multiple FCB-SLP monomer fragments 
by digestion with Nrul and EcoRV and electrophoresis on agarose gels. One clone 

20 was identified containing two inserts, one of approximately 2. 1 kb and the other of 
2.8 kb. Both inserts were cloned individually and transferred to the expression 
vector pSY751. Plasmid pSY1325 was digested with Nrul and Pvull and the 2.1 
and 2.8 kb insert bands were purified. These DNA fragments were ligated with 
pSY751 that had been digested with PvuII. The products of this reaction were 

25 transformed into E. coli and selected for growth on the antibiotic amjjicillin. 

Plasmid DNA from individual colonies was analyzed by restriction digestion for the 
presence of the FCB-SLP polymer gene. Two clones were identified, pSY1520 and 
1521. containing the 2.1 and the 2.8 kb inserts, respectively. 

£, coli cells containing pSY1520 and pSY1521 were grown aiL 30X in LB 

30 medium containing 50 ^g/ml ampicillin to an OD^ooOf 0.7. Production of the FCB- 
SLP polymer proteins were induced by increasing the culture temperature to 42°C 
for 1.5 hours. The cells were harvested by centriftigation and lysed in sample buffer 
containing sodium dodecylsulfate (SDS) and P-mercaptoethanol by heating at 100*C 
for 5 minutes. Samples of these lysates corresponding to 5 x 10^ cells were applied 

35 10 an 8% polyacrylamide gel containing SDS, electrophoresed, and transferred to 
nitrocellulose filters by eleciroblotting. The filters were incubated ei ther with anti- 
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SLP or anti-FCB peptide antibody. Specific immunoreactivity with the anti-SLP 
antibody was observed for a protein band of approximately 75 kd in lysates of 
pSY1520, 95 kd in lysates of pSY1521, and 120 kd in lysates of the SLFIII clone 
pSYl 186. Reactivity with the anti-FCB antibody was observed only for the two 
5 FCB-SLP polymer bands. 



pSYlS20 FCB-SLPIII 767 AA MW 57,467 

( f M) DPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
GAGS (GAGAGS) tGAAVTGRGDSPASAAGY 
10 C (GAGAGS) ,GAAVTGRGDSPASAAGy] , 

GAGAGSGAGAGSGAGAMDPGRYQLSAGRYHYOLVWCQK 



?\i\^m\<\ r^ronA ronstmction: 

Two oligonucleotide strands containing multiple cloning sites (M CS) were 
15 synthesized and purified as described in Example 1 . 

Fokl Fokl Seal 

O.A) 5* - GTGCTGCGGATGCTCGAGATGGTGCATGCATGTACATCCGAGTACTTCGAT 

0 , B ) 3 ' - ACGCCTACGAGCTCTACCACGTACGTACATGTAGGCTCATGiVAGCTA 

20 

After annealing, the two oligonucleotide strands were ligaied wiih pSY937 
which had been digested with BanI and EcoRV RENs. The product of i:he ligation 
mixmre was transformed into £. coli and selected on bacterial plates containing the 
antibiotic chloramphenicol. Plasmid DNA from individual colonies wa:> analyzed on 

25 agarose gel electrophoresis after digestion with Seal and StuI RENs. One plasmid, 
pPT0124, contained the expected DNA fragment. 

The new MCS were then moved to plasmid pSY1367. This plasmid is a 
derivative of pSY1299, which was digested with Neil REN and the larfje DNA 
fragment was purified by agarose gel electrophoresis and NACS purification. The 

30 purified DNA fragment was treated with DNA Polymerase (Example I K ligated, 
then digested with Fokl prior to transformation in £. coli strain HBlOl . Plasmid 
DNA from single colonies was purified and analyzed by restriction digests. One 
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plasmid, pSY1366. was found lo be correct and lacking the only Fokl site present in 
pSY1299. 

Two oligonucleotide strands were synthesized and purified as described in 
Example 1: 

5 (Banll) Fokl 

1 .A) 5'- CTACATGTGTTACACATCCCGTGC 

1 . B ) 3 ' - CCGAGATGTACACAATGTGTAGGGCACG 

Oligonucleotide strands l.A and l.B were annealed and ligateti with the 
10 DNA of plasmid pSY1366 which had been digested with Banll and Fupl RENs. The 
products of this ligation reaction were transformed into £. coli strain HBlOl . 
Plasmid DNA from transformed colonies was purified and digested with Fokl. 
Clones which linearized with Fokl were sequenced. Plasmid pSY1367 conuined the 
desired MCS sequence and was chosen for subsequent constructions. 
15 Plasmids pPT0124 and pSY1367 were digested with Nrul and Ncol and the 

DNA fragments were purified by agarose gel electrophoresis and NACS 
purification. The small fragment (approximately 500 bp) from pPT0:i24 was ligated 
wiih the large fragment from pSY1367. The product of the ligation rtiixnire was 
transformed into £. coli. Plasmid DNA from single colonies was purified and 
20 analyzed by restriction digests and DNA sequencing. One plasmid, pPT0134, 

contained the desired sequence and was used as the acceptor vector fc-r further DNA 
constructions. 

<;fj pf rnnstniction and Expression: 

25 Plasmid DNA pSY1521 was digested with BanI REN and the SLPF (FCB- 

SlpIII) monomer was purified using NACS colunm (see Example 1). The DNA 
fragment was ligated with pPT0I34 previously digested with Fokl REN, treated 
with calf intestinal phosphatase (see Example 1), and subsequently purified using 
NACS column. The product of this ligation reaction was transformed into £. coli 

30 strain HBIOI. Transformants were selected for resistance to chloranriphenicol. 
Plasmid DNA from transformed colonies was purified and digested v/ith Fokl. 
Clones with the correct restriction pattern were sequenced. Plasmid pPT0141 
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contained the desired SLPF monomer sequence and was chosen for subsequent 
constructions. 

Plasmid pSY1377 was digested with BanI REN and the SELP3 gene 
monomer DNA fragment was purified by agarose gel electrophoresis followed by 

5 NACS column. The purified SELP3 gene monomer, 268 bp, was ligated with 
plasmid DNA pPT0l41 previously digested with BanI REN and purified using 
NACS column. The product of this ligation reaction was transformed into £. coli 
strain HBIOL Transformants were selected for resistance to chloramphctnicol. 
Plasmid DNA from transformed colonies was purified and digegsted with Fokl. 

10 Clones with the correct restriction pattern were sequenced. Plasmid pPT0146 
contained the desired SELPF monomer DNA. 

Plasmid DNA from pPT0146 was digested with Fokl REN and ttie digestion 
fragments were separated by agarose gel electrophoresis. The SELPF gime 
fragment, 477 bp, was excised and purified by NACS column (see Example I). The 

15 purified fragment was ligated with plasmid pSY1262 which had been dijjested with 
REN BanI. The product of this ligation reaction was transformed into E. coli strain 
HBIOI. Transformants were selected for resistance to kanamycin. Plasmid DNA 
from individual transformants was purified and analyzed for increased size due to 
SELPF multiple DNA insertion. Several clones were obtained ranging in size from 

20 1 kbp to 6 kbp. One clone pPT0183, with an insert of approximately 2.9 kbp was 
chosen for expression and protein analysis. 

E. coli strain HBlOl containing plasmid pPT0183 was grown as described in 
Example 1 . The protein produced by these cells was analyzed by SDS-PAGE for 
detection of reactivity to SLP and ELP antibodies. In every analysis a strong 

25 reactive band was observed with an apparent molecular weight of approximately 100 
kD. 

PPT01B3 SELPF 1011 AA MW 79,597 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
30 GAGS ( G AGAGS ) ? ( GVGVP ) , 

C (GAGAGS) ijGAAVTGRGDSPASAAGY (GAGAGS) 5 (GVGVP) ,] ^ 
(GAGAGS ) ijGAAVTGRGDSPASAAGY (GAGAGS ) , 
GAGAMDPGRYQLSAGRyHYQLVWCQK 
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Example 6 

?\^m\<^ PPT0?S^ Constniction: 

Plasmid pACYC184 (Chang, A.Y.C. and Cohen, S.N., /. BacterioL, 
134:1141-1156 (1978)) was digested with BanI REN, purified by agarose gel 
5 electrophoresis, and the DNA fragment corresponding to approximately 2,000 bp 
was further purified using a NACS column. This DNA fragment was filled in using 
DNA polymerase (see Example 1) and then self-ligated. The products of the 
ligation mixture were transformed into E. coli strain HBlOl and selected on 
bacterial plates containing chloramphenicol at 30 ^tg/ml. Plasmid DNA from 
10 individual colonics was linearized by digestion widi Eco47III. One clone, pPT0235, 
was used as the acceptor vector for subsequent DNA manipulations. 

Two oligonucleotide strands were synthesized and purified as described in 
Example 1: 

15 (EC047III) Pmel Pmll Nrul BanI 

StuI EcoRV SnaBI(Snal) 

1.5'- GCTATGTTTAAACCACGTGTTCGCGATCCGGGTGCCGATCCAGGCCTGCGAT ATCAGTACGTA 
2.3'- CGATACAAATTTGGTGCACAAGCGCTAGGCCCACGGCTAGGTCCGGACGCTATAGTCATGCAT 
AMFKPRVRDPGADPGLRYQYV 

20 

The two oligonucleotide strands were annealed and ligated with the DNA of 
plasmid pPT0235 which had been digested with Eco47ni and Snal RE;Ns. The 
product of this ligation reaction was transformed into £. coli su-ain HKIOI. Plasmid 
DNA from transformants was purified and digested with EcoRI in combination with 
25 Eco47III or Snal or Nrul RENs. Plasmid DNA from two clones that gave the 
correct digestion pattern was sequenced. One plasmid, designated pPT0285, was 
found to be correct and chosen for further constructions. 

n,P^ 7 rnnstniction and Ext?ression: 
30 One oligonucleotide strand coding for the CLP 3.7 gene moncmer (see Table 

5) was synthesized using an Applied Biosysiems DNA synthesizer model 381A and a 
2000A synthesis column supplied by Glen Research. After the synthesis, the 226 
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base DNA fragment was deproiected and cleaved from the column support by 
treatment in NH4OH at 55** for 6 hours. 

Tablg 5 

5 s'-ATGGCAGCGJUU^GGGGACCGGTGCCCCGGGTACTCCTGGTCCACAAGGTCTGCCGGGAAGCCCAGGG 
GCTCCGGGTACTCCTlGGTCCGCAAGGCCrGCCGGGTTCy^CCGGGTGCTCCGGGAACT 
AGGGCTTGCCGGGATCCCCAGGTGCACCAGGAACGCCGGGACCrrCAGGGTCTTCCGGGl'AGCCCTGG 
TGCCTTTCCGCTAAAGTCCTGCCGT - 3 ' 

10 Two additional DNA strands were synthesized to be used as primers for PCR 

amplification. The synthesis and purification of these DNA primers was performed 
as described in Example 1. The two strands are: 

1. 5' - AAG AAG GAG ATA TCA TAT GGC AGC GAA AGG GGA CC -3* 

2. 5' - CGC AGA TOT TTA AAT TAC GGC AGG ACT TTA GCG GAA A -3' 

15 

The PCR reaction was performed as described in Example 1 . 
The DNA was resuspended and digested with BanI REN as described in 
Example 1 . The digested DNA was purified as described in Example 1 , and then 
ligated with pPT0285 previously digested with BanI, treated with SAP, and purified 
20 as described in Example 1 . The product of the ligation reaction was transformed 
into £. coU strain HBlOl . Plasmid DNA from tmsformants was purified and 
analyzed as described below. Colonies were picked and transferred onto a plate and 
into a 0.5 ml microfuge mbe containing 50 ^1 of lysis buffer (1 % Twe<m 20, 10 
Tris-HCl pH 8.0, 1 mM EDTA). The tube was closed, incubated at 9:>°C for 10 
25 minutes. And then cooled to room temperature. 5 fi\ of lysate was adiled to 45 fil 
MasterMix (Ix PCR buffer as described previously, 5 U Amplitaq, 20i3 /xM dNTPs) 
in a 0.5 ml Perkin Elmer thin-walled Gene Amp^^ reaction nibe. Amplification ws 
performed in a Perkin Elmer DNA Thermal cycler model 480 for 30 cycles with the 
following step cycle of 1 minute each: 95°C, 52*C, and 72**C. Aliquots from 
30 different reactions were analyzed by agarose gel electrophoresis using 1.5% Low 
Melting Point agarose in 0.5x TAE buffer. Plasmid DNA from the clones showing 
the correct size insert was purified and anlyzed by DNA sequencing. Plasmid 
pPTOSlO contained the desired CLP 3.7 monomer sequence (see Table 6). 
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BanI Aval/Smal 
5 ' - GGTGCCCCGGGTACTCCTGGTCCACAAGGTCTGCCGC3GAAGCCCA 
3 ' - CCACGGGGCCCATGAGGACCAGGTGTTCCAGACGGCCCTTCGGGT 

GAPGTPGPQGLPGSP 

Banll Gsul StuI Dralll 

GGGGCTCCGGGTACTCCAGGTCCGCAAGGCCTGCCGGGTTCACCG 
CCCCGAGGCCCATGAGGTCCAGGCGTTCCGGACGGCCCAAGTGGC 
GAPGTPGPQGLPGSP 

BGII BAMHI 
GGTGCTCCGGGAACTCCTGGCCCGCAGGGCTTGCCGGGATCCCCA 
CCACGAGGCCCTTGAGGACCGGGCGTCCCGAACGGCCCTAGGGGT 
GAPGTPGPQGLPGSP 

ECO0109I BanI 
GGTGCACCAGGAACGCCGGGACCTCAGGGTCTTCCGGGTAGCCCTGGTGCC - 3 
CCACGTGGTCCTTGCGGCCCTGGAGTCCCAGAAGGCCCATCGGGACCACGG - 5' 
GAPGTPGPQGLPGSP (G A) 

CLP3.7 Pol yiner ronstmction: 

Plasmid DNA from pPT0310 was digested with BanI REN and the digestion 
fragments were separated by agarose gel electrophoresis. The CLP 3.7 gene 
fragment, 180 bp, was excised and purified by NACS column (see Methods). The 
purified fragment was ligated with plasmid pSY1262 which had been prepared as 
follows: pSY1262 plasmid DNA was digested with BanI REN and subiiequently 
treated with Shrimp Alkaline Phosphatase (SAP) as described in ExamiJle 1. 

The product of this ligation reaction was transformed into £. coli strain 
HBIOI. Transformants were selected for resistance to kanamycin. Plasmid DNA 
from individual transformants was purified and analyzed for increased size due to 
CLP 3.7 multiple DNA insertion. Several clones were obtained and t\/o of them 
containing inserts of approximately 1.25 kbp and 2.6 kbp (pPT0314 and pPT0312 
respectively) were chosen to be used for expression of CLP 3.7. 

CLP 3.7 Analysis: 

£. coli strain HBIOI containing plasmid pPT0312 or pPT0314 were grown 
as described in Example 1 . The proteins produced by these cells were analyzed by 
SDS-PAGE for detection of reactivity to CLP antibodies. In every analysis a strong 
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reactive band was observed with an apparent molecular weight of 130 kD and 50 kD 
respectively. 

PPT0312 CLP 3.7 837 AA MW 72,637 

5 MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
[ IGAPGTPGPQGLPGSP) 4] 13 
GAMDPGRYQLSAGRYHYQLVWCQK 

PPT0314 CliP 3.7 417 AA MW 37,060 

10 MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
[ (GAPGTPGPQGLPGSP) J 6 
GAMDPGRYQLSAGRYHYQLVWCQK 

Pp/V5^1-A Pnlvmer rnnstniction 
15 The Protein Polymer Adhesive Substrate (PPAS) polymer was designed to 

include a 17 amino acid oligopeptide block of human fibrin gamma chain within a 
strucniral backbone consisting of 3 complete repeats of a 15 amino acid peptide 
block of human collagen type I (GAPGTPGPQGLPGSP. the CLP3.7 monomer 

repeating amino acid sequence) . ~ 

20 

PPASI-A ^^"g Monomer Synthesis and Construction 

The PPASl-A amino acid monomer sequence with the fibrin ganuna 

sequence shown in bold is as follows: 

(GAPGTPGPQGLPGSP) ^ GAPGTPGEGQQHHLGGAKQAGDVGSP 

25 

One oligonucleotide strand (see Table 7) was synthesized using an Applied 
Biosystems DNA synthesizer model 381A and a 2000A pore resin synihusis column 
supplied by Glen Research. During the synthesis, the required interrupt- pause steps 
for reagent bottle changes were minimized. After the synthesis, the 123 base DNA 
30 fragment was deprotected and cleaved from the column support by treatment in 
ammonium hydroxide at SS'^C for 6 hrs. 
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Table 7 

5 ATGGCAGCGAAAGGGGACCGTGCACCAGGW^CGCCGGGAGAAGGTCAACAGCACCATCTTGG 
TGGAGCGAAACAGGCAGGCGACGTCGGTAGCCCTGGTGCCTTTCCGCTAAAGTCCTGCCGT -3 ' 

5 The PCR reaction was then perfoimed as previously described using the 

same primers as were used in the construction of the CLP3.7 monomer. The 
amplified DNA was then resuspended and digested with ApaU and Dral RENs. The 
digested DNA was then purified using a Probind filter followed by a liio-Spin 
column and then ligated with pPTOBlO previously digested with ApaU and EcoRV 

10 RENs and purified by NACS colunm. The products of the ligation reaction were 
transformed into E. coli strain HBIOI. Plasmid DNA from transformants was 
purified and analyzed by digestion using EcoO109, Hindi and Hindlll RENs. 
Plasmid DNA from the clones showing the correct size insert was purified and 
analyzed by DNA sequencing. Plasmid pPT0318 contained the desired PPASl-A 

15 gene monomer sequence (see Table 8). 
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5' • 
3' - 



BanI Aval/Stnal 

GGTGCCCCGGGTACTCCTGGTCCACAAGGTCTGCCGGGAAGCCCA 
CCACGGGGCCCATGAGGACCAGGTGTTCCAGACGGCCCTTCGGGT 
GAPGTPGPQGLPGSP 



BanI I Gsul S tul Dr al 1 1 

GGGGCTCCGGGTACTCCAGGTCCGCAAGGCCTGCCGGGTTCACCG 
CCCCGAGGCCCATGAGGTCCAGGCGTTCCGGACGGCCCAAGTGGC 
10 GAPGTPGPQGLPGSP 

Bgll BainHX 
GGTGCTCCGGGAACTCCTGGCCCGCAGGGCTTGCCGGGATCCCCA 
CCACGAGGCCCTTGAGGACCGGGCGTCCCGAACGGCCCTAGGGGT 
15 GAPGTPGPQGLPGSP 

GGTGCACCAGGAACGCCGGGAGAAGGTCAACAGCACCATCTTGGT 
CCACGTGGTCCTTGCGGCCCTCTTCCAGTTGTCGTGGTAGAACCA 
GAPGTPGEGQQHHLG 

20 

Aatll BanI 
GGAGCGAAACAGGCAGGCGACGTCGGTAGCCCTGGTGCC - 3 ' 
CCTCGCTTTGTCCGTCCGCTGCAGCCATCGGGACCACGG - 5 ' 
GAKQAGDVGSP(GA) 

25 

ron^niction of Expression Plasmid pPT0317 

Plasmid DNA pSY1262 was linearized with PvuII REN, then passed through 
a Probind filter followed by a Bio-Spin column. The DNA was then treated with 
SAP and ligaied with a DNA fragment from pQE-17 (QIAGEN Catalog #33173 ) 
30 prepared as follows. Plasmid DNA pQE-17 was digested with Bglll and. Hindlll 
RENs and the 36 bp fragment (see Table 9) was purified using a Probind filter and 
then a Bio-Spin column. The DNA was purified further using a Microcon-30 filter 
and the filtrate containing the 36 bp was kept. The DNA was then treated with DNA 
Polymerase I and purified through a Probind filter and then a Bio-Spin column. 

35 

Table? 

5 ' - GATCTTCGATCTCATCACCATCACCATCACTA 

3 » - AAGCTAGAGTAGTGGTAGTGGTAGTGATTCGA 

40 The products of the ligation reaction were transformed into £. coli strain 

HBIOI. Plasmid DNA from transformants was purified and analyzed b)' digestion 
using BstYI and Bstll07I RENs. Plasmid DNA from the clones showing the correct 
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restriction pattern was purified and analyzed by DNA sequencing. Plasmid pPT0317 
contained the desired DNA insert and was used for further DNA manipulations. 



PPA^l-A ?n\yrry^r rnnfitniction 

5 Plasmid DNA from pPT0318 was digested with BanI REN and the digestion 

fragments were separated by agarose gel electrophoresis. The PPAS1-/l gene 
fragment, 216 bp, was excised and purified using the Ultrafree-MC filter. The 
purified fragment was ligated with plasmid pPT0317 which had been p:repared as 
follows. Plasmid DNA pPT0317 was digested with BanI REN, then passed through 

10 a Probind filter and then a Bio-Spin column. The DNA was then treated with SAP. 

The products of the ligation reaction were transformed into £. coli strain 
HBIOI. Transformants were selected for resistance to kanamycin. Plasmid DNA 
from individual transformants was purified and analyzed using EcoRI and EcoRV 
RENs for the presence of PPASl-A multimer gene inserts. Several clones were 

15 obtained with insert sizes ranging from 200 bp to approximately 4 kb. Several clones 
containining from 10 to 20 repeats were chosen for use in expression of the PPASl- 
A polymer. 

PPA^^-^ Fypre5;sinn Analvsis 

20 E. coli strain HBlOl containing plasmid pPT0321, pPT0325, pPT0326, or 

pPT0327 was culmred as previously described. The proteins produced by these cells 
showed strong reactive bands of apparent molecular weights ranging from 80 kD to 
180 kD when analyzed by western blot for reactivity to CLP antibody. One clone, 
pPT0321, containing 10 repeats of the PPASl-A monomer was selected for further 

25 study. 

pPT0321 PPASl-A 762 AA MW 68,056 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
[ (QAPGTPGPQGLPGSP) 5 GAPGTPGEGQQHHLGGAKQAGDVGSP] „ 
30 GAMDPGRYQDLRSHHHHHH 
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EXAMPT.F.7 
r^p^rnictinn of SELP8K and SELP8E 

Polymers were prepared designated SELP8K and SELP8E, which are 
characterized by having specific chemically reactive functional groups v/iihin the 
5 elastin-like block. The construction of these polymers is described below starting 
from the previous gene monomer, SELPO (see U.S. Patent #5.243,038, pSY1298, 
where SELPO is termed EBSI). 

SELP8K and SELP8E amino acid monomer sequence design: 
10 SELP8K MONOMER (GhGAGS) , (GVGVP)< GKGVP (GVGVP) 3 

SELF 8 E MONOMER (GAGAGS), (GVGVP) 4 GEGVP (GVGVP) 3 

r<;^n^micTinn of SELP8 Gene Monomer 

Plasmid pSY1378 (see US Patent #5,243,038) was digested with BanI REN, 
15 purified using agarose gel electrophoresis followed by NACS colunui, iuid the DNA 
was then eihanol precipitated in 2.5 M ammonium acetate and ligated v/ith pPT0134 
(See PCT\US92\09485) previously digested with Fokl REN. phenol/chloroform 
extracted and ethanol precipitated. 

The products of the ligation mixture were transformed into E,coli strain 
20 HBIOI. Plasmid DNA from u^nsformants was purified and analyzed by digestion 
using Nrul and Xnml RENs. Plasmid pPT0255 containing the desired restriction 
pattern was obtained and was used for subsequent constructions. 

Plasmid DNA pPT0255 was treated with CfrlOI REN followed by RNAse. 
The digestion fragments were separated by agarose gel electrophoresis, the DNA 
25 was excised and seif-ligated. The products of the ligation mixture were transformed 
into E.coli strain HBlOl, Plasmid DNA from transformants was purifi(*d and 
analyzed by digestion using Nael and StuI RENs. Plasmid pPT0267 containing the 
desired deletion was used for subsequent constructions. 

Two olgonucleotide strands as shown in Table 10 were synthesiized and 
30 purified as described in Example 1 . 
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Table Ifl 

5'- CTGGAGCGGGTGCCTGCATGTACATCCGAGT -3' 

3*- CCGAGACCTCGCCCACGGACGTACATGTAGGCTCA -5' 

5 The two oiigomicleotide strands were annealed and ligated with the DN A of plasraid 

pPT0267 which had been previously digested with Banll and Seal RENs, and 

purified by agarose gel elctrophoresis followed by NACS column. 

The products of this ligation reaction were transformed into Exoli strain 

HBIOI. Plasmid DNA from transformants was purified and digested with Dral. 
10 Plasmid DNA from two clones that gave the correct digestion pattern vras 

sequenced. One plasmid DNA, designated pPT0287. was found to be correct and 

chosen for further constructions. 

Plasmid DNA pSY1298 (see U.S. Patent # 5,243,038) was digijsted with 

Banll REN, and the SELPO gene fragment was purified by agarose gel 
15 electrophoresis followed by NACS and then ligated to pPT0287 digested with Banll. 

The enzyme was then removed using phenol/chloroform extraction and ethanol 

precipitation. 

The products of the ligation mixture were transformed into E.coli strain 
HBIOI. Plasmid DNA from transformants was purified and analyzed by digestion 
20 using Dral REN. Plasmid DNA from the clones showing the correct restriction 
pattern was further digested with Banll. Ahall and StuI RENs. Plasmid pPT0289 
contained the desired SELP8 monomer sequence (see Table 11). 

Tabic 11 

BanI Banll 
25 GGT GCC GGT TCT GGA GCT GGC GCG GGC TCT GGA GTA GGT GTG CCA GGT 
CCA CGG CCA AGA CCT CGA CCG CGC CCG AGA CCT CAT CCA CAC GGT CCA 
GAGSGAGAGSGVGVPG 



30 GTA GGA GTT CCG GGT GTA GGC GTT CCG GGA GTT GGT GTA CCT GGA GTG 
CAT CCT CAA GGC CCA CAT CCG CAA GGC CCT CAA CCA CAT GGA CC:T CAC 
VGVPGVGVPGVGVPGV 

Smal 

35 GGT GTT CCA GGC GTA GGT GTG CCC GGG GTA GGA GTA CCA GGG GVA GGC 
CCA CAA GGT CCG CAT CCA CAC GGG CCC CAT CCT CAT GGT CCC CAT CCG 
GVPGVGVPGVGVPGVG 
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BanIX 

gtc cct gga gcg ggt gct ggt agc ggc gca ggc gcg ggc tct ggfl. gcg 
cag gga cct cgc cca cga cca tcg ccg cgt ccg cgc ccg aga cct cgc 
vpgagagsgagagsga 

5 

rnnmicTinr^ nf SFTPSK and SELP8E Gene Monomers 

One oligonucleotide strand coding for a portion of the SELP8 gene monomer 
was synthesized with a single base polymorphism at position 90. The use of both 
adenine and guanidine at this position produced oligonucleotides from a single 

10 synthesis that encoded the amino acids lysine and g»uumic acid (see Table 12). The 
synthesis was conducted using an Applied Biosystems DNA synthesizer model 381 A 
and a 2000A synthesis column supplied by Glen Research. During the synthesis the 
required interrupt-pauses for bottle changes were minimized. After the synthesis the 
202 base DNA fragment was deprotected and cleaved from the column support by 

15 treatment in 30% ammonium hydroxide at 55**C for 6 hrs. 

Table 12 

5*- atggcagcgaaaggggaccgggctctggtgttggagtgccaggtgtcggtgttccgggtgtagg 
cgttccgggagttggtgtacctgga (A/G) aaggtgttccgggggtaggtgtgccgggcgttggagta 
20 ccaggtgtaggcgtcccgggagcgggtgctggtagcggcgcaggcgcgggctctttccgctaaagtcc 

tgccgt -3* 

Two additional DNA strands were used as primers for PGR amplification. The two 
strands were: 

25 1 . 5'-AAGAAGGAGATATCATATGGCAGCGAAAGGGGACC-3' 
2 . 5 • -CGCAGATCTTTAAATTACGGCAGGACTTTAGCGGAAA-3 ' 

The PGR reaction was carried out and the reaction product was purified as 
described in Example 1 . 
30 The DNA was resuspended and digested with Banll REN as described in 

Example 1 . The digested DNA was then separated by low-melting agarose gel 
electrophoresis and ligated with pPT0289 previously digested with Baidl RENs and 
purified by NAGS column. The products of the ligation reaction wer; transformed 
into E.coli strain HBlOl . Plasmid DNA from isolated transformants was purified 
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and analyzed by digestion using ApaU and EcoNI RENs. Plasmid DMA from the 
clones showing the correct restriction pattern were further analyzed by digestion 
using Asp700 REN to distinguish between clones encoding a lysine or glutamic acid 
at the polymorphic position. Plasmid DNA from clones containing each of the 
5 polymorphs was purified and analyzed by DNA sequencing. Plasmid 13PT0340 
contained the desired SELP8K monomer sequence and pPT0350 contaiined the 
desired SELP8E monomer sequence (see Tables 13 and 14, respectively). 



Tablg 13 

10 

BanI Banll 

GGT GCC GOT TCT GGA GCT GGC GCG GGC TCT GGT GTT GGA GTG CCIA GGT 
CCA CGG CCA AGA CCT CGA CCG CGC CCG AGA CCA CAA CCT CAC GGT CCA 



15 



20 



25 



30 



EcoNI 

GTC GGT GTT CCG GGT GTA GGC GTT CCG GGA GTT GGT GTA CCT GGA AAA 
CAG CCA CAA GGC CCA CAT CCG CAA GGC CCT CAA CCA CAT GGA CC:T TTT 
VGVPGVGVPGVGVPGK 

GGT GTT CCG GGG GTA GGT GTG CCG GGC GTT GGA GTA CCA GGT GTA GGC 
CCA CAA GGC CCC CAT CCA CAC GGC CCG CAA CCT CAT GGT CCA CAT CCG 
GVPGVGVPGVGVPGVG 

Smal Banll 
GTC CCG GGA GCG GGT GCT GGT AGC GGC GCA GGC GCG GGC TCT G(3A GCG 
CAG GGC CCT CGC CCA CGA CCA TCG CCG CGT CCG CGC CCG AGA CCT CGC 
VPGAGAGSGAGAGSGA 



Table 14 



35 BanI 

GGT GCC GGT TCT GGA GCT GGC GCG 
CCA CGG CCA AGA CCT CGA CCG CGC 
GAGSGAGA 



GTC GGT GTT CCG GGT GTA GGC GTT 
CAG CCA CAA GGC CCA CAT CCG CAA 
VGVPGVGV 



Banll 

GGC TCT GGT GTT GGA GTG CCA GGT 
CCG AGA CCA CAA CCT CAC GST CCA 
GSGVGVPG 

ECONI 

CCG GGA GTT GGT GTA CCT GGA GAA 
GGC CCT CAA CCA CAT GGA CCT CTT 
PGVGV.PGE 



Asp700 

GGT GTT CCG GGG GTA GGT GTG CCG 
CCA CAA GGC CCC CAT CCA CAC GGC 
GVPGVGVP 



GGC GTT GGA GTA CCA GGT GTA GGC 
CCG CAA CCT CAT GGT CCA CAT CCG 
GVGVPGV G 
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Smal 

GTC CCG GGA GCG GGT GCT GGT AGC 
CAG GGC CCT CGC CCA CGA CCA TCG 
VPGAGAGS 



Banll 

GGC GCA GGC GCG GGC TCT GGA GCG 
CCG CGT CCG CGC CCG AGA CCT CGC 
GAGAGSGA 



SELP8K Polymgr Cgnstniction 

Plasmid DNA from pPT0340 was digested with BanI REN and iJie digestion 
fragments were separated by agarose gel electrophoresis. The SELP8K gene 

10 fragment, 192 bp. was excised and purified by NACS colunm. The purified 
fragment was ligated with plasmid pPT0317 which had been digested with BanI 
REN, passed through a Millipore Probind and a Bio-Spin 6 column. Tlie DNA was 
then treated with shrimp alkaline phosphatase (SAP) as described in Example 1 , 
The products of this ligation reaction were transformed into E,coli strain 

15 HBIOI. Transformants were selected for resistance to kanamycin. Plasmid DNA 
from individual transformants was purified and analyzed for increased si^e due to 
SELP8K monomer multiple DNA insertion. Several clones were obtained with insert 
sizes ranging from 200 bp to approximately 7 kb. Clones coniainining from 6 to 32 
repeats, were used for expression of the SELP8K protein polymer (pPTCi341, 

20 pPT0343, pPT0344, pPT0345 and pPT0347). 



SELP8K Expression Analvsis 

E. coli strain HBlOl containing plasmids pPT0341, pPT0343, pPT0344, 
pPT0345 and pPT0347 were grown as described in Example 1 . The proteins 

25 produced by these cells were analysed by Western blot for detection of proteins 
reactive to SLP antibodies. Each clone produced a strongly reactive band. The 
apparent molecular weights of the products ranged from approximately 35kD to 
greater than 250 kD. Strain pPT0345 produced an SLP antibody reactive band of 
apparent molecular weight 80,000. The expected amino acid sequence cf the 

30 SELP8K polymer encoded by plasmid pPT0345 is shown below. 
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PPT0345 SELPBK 884 AA MW 69,772 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPMGAGSGAGAGS 
( (GVGVP) , GKGVP (GVGVP)3 (GAGAGS) , ] „ 
5 (GVGVP), GKGVP (GVGVP), (GAGAGS) 7 

GAGAMDPGRYQDLRSHHHHHH 

Thr '"l>nftT"'^^'"" SEIPOK Polymers 

The copolymer structure of SELP8K consists of silk-like blocks (SLP block) 

10 and elastin-like blocks (ELP block) in the following sequence: [(SLP b]ock)4 (ELP 
block)gl. Additional polymers were designed to have different resorption and 
solution properties by adjusting their silk-like to elastin-like block lengths while 
maintaining their reactive properties. SELPOK contains half the length of 
crystallizable silk-like blocks than SELP8K while maintaining the dispersion 

15 frequency with respect to the elastin-like blocks. 

A polymer with intervening sequences to promote in vivo resor|>iion through 
proteolytic cleavage by coUagenase (92 kd) was also designed. SELPOK-CSl 
contains two adjacent cleavage sites for collagenase (PLGP) within a six amino acid 
insert (GAGAGS GVGVP L G P L G P GVGVP). 

20 

rp n^tnictinn of Pla smid pPT0317 

Plasmid DNA pSY1262 (see U.S. Patent No. 5,243.038) was linearized with 
PvuII REN, then passed through a Probind filter and a Bio-Spin 6 column. The 
DNA was then treated with Shrimp Alkaline Phosphatase (SAP). The linearized 

25 pSY1262 DNA was then ligated with a DNA fragment from pQE-17 ((JIAGEN 
Catalog #33173) prepared as follows. Plasmid DNA pQE-17 was digested with 
Bglll and Hindlll RENs and the 36 bp fragment shown in Table 15 was purified 
using a Probind filter and a Biospin column. The DNA was purified farther using a 
Microcon-30 filter and the filtrate, containing the 36 bp fragment, was kept. The 

30 DNA was then treated with DNA Polymerase I and purified using a Pi'obind filter 
and a Biospin column (see Example 1). 
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Tabig 15 

5'-GATCTTCGATCTCATCACCATCACCATCACTA 

3 '-AAGCTAGAGTAGTGGTAGTGGTAGTGATTCGT 

5 The product of the ligation reaction was transformed into £. colt strain 

HBIOI. Plasmid DNA from transformants was purified and analyzed by digestion 
using Bstll07I and EcoRV RENs. The clones containing the desired DNA fragment 
were further digested with Bstll07I and BstYI RENs to determine the orientation of 
the insert. Plasmid DNA from the clones showing the correct restriction pattern was 

10 purified and analyzed by DNA sequencing. Plasmid pPT0317 contained the desired 
DNA insert and was used for further DNA constructions. 

SFl .POK Pnlvmer Constniction 

One oligonucleotide strand as shown in Table 16 was synthesized using an 
15 Applied Biosystems DNA synthesizer model 381A and a 2000A synthesi:* column 
supplied by Glen Research. After the synthesis the 93 base DNA fragmcrni was 
deprotected and cleaved from the column support by treatment in ammonium 
hydroxide at 55**C for 6 hours. 

20 lablfiLlfi 

5 ' - ATGGCAGCGAAAGGGGACCGGTGCCGGCGCAGGTAGCGGAGCCGGTGCGGGCTCAA AAAGGG 
CTCTGGTGCCTTTCCGCTAAAGTCCTGCCGT -3' 

The PGR reaction was performed using the same two DNA primer strands as 
25 described for the construction of the SELP8K gene monomer and the reaction 
product was purified. The DNA was resuspended and digested with BanI REN, 
The digested DNA was then separated by low-melting agarose gel and li gated with 
pPT0285 (see PCT/US92/09485) previously digested with BanI REN ard purified 
by NACS column. The product of the ligation reaction was transforme<l into £. coli 
30 strain HBlOl . Plasmid DNA from transformants was purified and analyzed by 

digestion using EcoRI and Banll RENs. Plasmid DNA from the clones showing the 
correct restriction pattern was then purified and analyzed by DNA sequencing. 
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Plasmid pPT0358 contained the desired sequence and was used for subsequent DNA 
constructions. 

Plasmid DNA from pPT0340 was digested with Banll REN and the digestion 
fragments were separated by agarose gel electrophoresis. The SELPOK gene 
5 fragment, 156 bp, (see Table 17), was excised and purified using an Ultrafree-MC 
filter followed by Bio-Spin 6 column. 

Tabic 17 

Banll 

G GGC TCT GGT GTT GGA GTG CCA GOT GTC GGT GTT CCG GGT GTA (JGC GTT 
10 C CCG AGA CCA CAA CCT CAC GGT CCA CAG CCA CAA GGC CCA CAT CCG CAA 
GSGVGVPGVGVPGVC;v 

ccg gga gtt ggt gta cct gga aaa ggt gtt ccg ggg gta ggt gt(; ccg 
ggc cct caa cca cat gga cct ttt cca caa ggc ccc cat cca cac ggc 
ISpgvgvpgkgvpgvgvp 

ggc gtt gga gta cca ggt gta ggc gtc ccg gga gcg ggt gct ggr agc 
ccg caa cct cat ggt cca cat ccg cag ggc cct cgc cca cga cca tcg 
gvgvpgvgvpgagags 

20 

Banll 

GGC GCA GGC GCG GGC TC 
CCG CGT CCG CGC CCG AG 
G A G A G S 

25 

The purified fragment was ligated with plasmid pPT0358 which had been 
digested with Banll REN, then passed through a Probind filter and a Microcon-30 
filter. The digestion fragments were then separated by agarose gel electrophoresis. 
The plasmid DNA was then excised and purified using an Ultrafree-MC! filter 

30 followed by Bio-Spin 6 column (see Example 1). 

The product of this ligation reaction was transformed into £. coU strain 
HBIOI. Transformants were selected for resistance lo chloramphenicol. Plasmid 
DNA from individual transformants was purified and analyzed for incnased size due 
to SELPOK multiple DNA insertion. Several clones were obtained witli inserts of 

35 different sizes. Plasmid pPT0359, pPT0360 and pPT0374 containing p;speciively 
18, 2 and 6 repeats of the SELPOK gene monomer were used for subse(}uent 
constructions. 

Plasmid DNA from pPT0359 and pPT0374 was digested with BanI REN and 
the digestion fragments were separated by agarose gel electrophoresis. The SELPOK 
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gene fragments, approximately 2800 bp and 1000 bp, were excised and purified by 
NACS column. The purified fragments were then ligated with plasmid plT'OSI? 
which had been digested with BanI REN, then passed dirough a Probind filter and a 
Bio-Spin 6 colunm. The DNA was then treated with Shrimp Alkaline Phosphatase 
5 (SAP), passed through a Probind filter and then a Bio-Spin 6 column (see Example 
1). 

The product of these ligation reactions was transformed into £. coli strain 
HBIOI. Transformants were selected for resistance to kanamycin. Plasmid DNA 
from individual transformants was purified and analyzed for increased swi due to 
10 SELPOK multiple DNA insertion. Several clones were obtained. Plasmid pPT0364 
and pPT0375 were chosen to be used for expression of SELPOK. 

SELPOK Expression Analvsis 

E. coli strain HBlOl containing plasmid pPT0364 and pPT0375 were grown 
15 as described in Example 1. The proteins produced by these cells were analysed by 
SDS-PAGE for detection of reactivity to ELP antibodies. In every analysis a strong 
reactive band was observed of an apparent molecular weight of approximsitely 95 kD 
and 35 kD respectively. 

20 pPT0364 SELPOK 1000 AA MW 80,684 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
[ (GAGAGS) 2 (GVGVP) , GKGVP (GVGVP) 3] 
(GAGAGS), GAGAMDPGRYQDLRSHHHHHH 

25 PPT0375 SELPOK 376 AA MW 31,445 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
[(GAGAGS) 2 (GVGVP) < GKGVP (GVGVP) 3] e 
( GAGAGS ) 2 GAGAMDPGRYQDLRSHHHHHH 

30 SELPQK-CSl Pfilymgr Construction 

Plasmid pPT0360 was digested with BanI REN and the digestion iragments 
were separated by agarose gel electrophoresis. The SELPOK gene fragment, 
approximately 300 bp, was excised and purified using an Ultrafree-MC filter 
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followed by Bio-Spin 6 column. The purified fragment was ligated wiith piasmid 
pPT0134 (see PCT/US92/09485) which had been digested with Fokl IlEN. The 
enzyme was heat inactivated at 65''C for 20 minutes and the ligation mixture was 
then passed through a Probind filter. The DNA was then treated with Shrimp 
S Alkaline Phosphatase (SAP), passed through a Probind filter and then a Bio-Spin 6 
column. 

The product of this ligation reaction was transformed into E, coli strain 
HBIOI. Transformants were selected for resistance to chloramphenicol. Piasmid 
DNA from individual transformants was purified and analyzed by digestion using 

10 Dral REN. One piasmid, pPT0363, showed the correct restriction pattern and was 
used for subsequent DNA constructions. 

One oligonucleotide strand as shown in Table 18 was synthesi2ed using an 
Applied Biosysiems DNA synthesizer model 381 A and a 2000A synthesis column 
supplied by Glen Research. After the synthesis the 141 base DNA fragment was 

15 deprotected and cleaved from the column support by treatment in anurionium 
hydroxide at 55**C for 6 hours. 

Table 18 

5 ' - ATGGCAGCGAAAGGGGACCGCCGGTGCGGGCTCTGGTGTTGGAGTGCCGCTGGGTCCTCTTGG 
20 CCCAGGTGTCGGTGTTCCGGGTGTAGGCGTTCCGGGAGTTGGTGTACCTGGAAAAGGITTCCGCTAA 
AGTCCTGCCGT -3* 



The PCR reaction was performed using the same two DNA primer strands as 
described for the construction of the SELP8K gene monomer and the reaction 

25 product was purified. The DNA was then resuspended and digested with BsrFI and 
EcoNI RENs. The digested DNA was treated with Probind and Micrccon-30 filters, 
a Bio-Spin 6 column, and then ligated with pPT0363 previously digested with BsrFI 
REN, treated with a ProBind filter and a Bio-Spin 6 column and then lurther 
digested with EcoNI REN. The digestion fragments were separated b)* agarose gel 

30 electrophoresis. The larger DNA band, approximately 2000 bp, was excised and 
purified using an Ultrafree-MC filter followed by Bio-Spin 6 column (see Example 
1). 
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The product of the ligation reaction was transformed into £. coli strain 
HBIOI. Plasmid DNA from individual transformants was purified and zinalyzed by 
digestion using Asp700I and EcoO109I RENs. Plasmid DNA from the clones 
showing the correct restriction pattern was then purified and analyzed by DNA 
5 sequencing. Plasmid pPT0368 (see Table 19) contained the desired sequence and 
was used for subsequent DNA constructions. 



Table 19 

Banll 

G GGC TCT GGT GTT GGA GTG CCG CTG GGT CCT CTT GGC CCA GGT GTC 
C CCG AGA CCA CAA CCT CAC GGC GAC CCA GGA GAA CCG GGT CCA a^G 
GSGVGVPLGPLGPGV 



GGT GTT CCG GGT GTA 
CCA CAA GGC CCA CAT 
G V P G V 



GGC GTT CCG GGA GTT 
CCG CAA GGC CCT CAA 
G V P G V 



GGT GTA CCT GGA AAA 
CCA CAT GGA CCT TTT 
G V P G K 



GGT GTT CCG GGG GTA GGT GTG 
CCA CAA GGC CCC CAT CCA CAC 
G V P G V G V 



CCG GGC GTT GGA GTA CCA GGT GTA 
GGC CCG CAA CCT CAT GGT CCA CAT 
PGVGVPGV 



GGC GTC CCG GGA GCG 
CCG CAG GGC CCT CGC 
G V P G A 



GGT GCT GGT AGC GGC 
CCA CGA CCA TCG CCG 
G A G 5 G 



Banll 
GCA GGC GCG GGC TCT 
CGT CCG CGC CCG AGA 
A G A G S 



Plasmid DNA pPT0368 was digested with Banll REN, and ttie digestion 
fragments were separated by agarose gel electrophoresis. The SELPOK-CSl gene 
fragment, 174 bp, was excised and purified using an Ultrafree-MC filter followed by 
Bio-Spin 6 column. The purified fragment was ligated with plasmid pFr0358 which 
had been digested with Banll REN, then passed through a Probind filter and a 
Microcon-30 filter. Subsequently the digestion fragments were separated by agarose 
gel electrophoresis. The plasmid DNA was then excised and purified using an 
Ultrafree-MC filter followed by Bio-Spin 6 column (see Example 1). 

The product of this ligation reaction was transformed into £. coli strain HBlOl . 
Transformants were selected for resistance to chloramphenicol. Plasmid DNA from 
individual u-ansformants was purified and analyzed for increased size due to SELPOK- 
CSl multiple DNA insertion. Several clones were obtained with insert !iizes ranging 
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from 1000 bp to approximately 3000 bp. Plasmid pPT0369 containing 16 repeats of 
the SELPOK-CSl gene monomer was used for subsequent constructions;. 

Plasmid DNA from pPT0369 was digested with BanI REN, Ifollowed by a 
Probind filter and then the digestion fragments were separated by agarose gel 

5 electrophoresis. The SELPOK-CSl gene fragment, approximately 2800 bp. was excised 
and purified by an Ultrafree-MC filter and desalted using a Bio-Spin 6 column. The 
purified fragments were then ligated with plasmid pPT0317 which had been digested 
with BanI REN and then passed through a Probind filter and a Bio-Spin 6 column. The 
DNA was then treated with Shrimp Alkaline Phosphatase (SAP), passed through a 

0 Probind filter and then a Bio-Spin 6 column (see Example I), 

The product of these ligation reactions was transformed into E, coli strain 
HBlOl . Transformants were selected for resistance to kanamycin. Plasmid DNA from 
individual transformants was purified and analyzed for increased size due to SELPOK- 
CSl multiple DNA insertion. Several clones were obtained. Plasmid pPT0370 was 

5 chosen to be used for expression of SELPOK-CSl . 

5;FT.P0K-r!.S1 Expression Analysis 

E. coli strain HBlOl containing plasmid pPT0370 was grown as described in 
Example 1. The proteins produced by these cells were analysed by SDS-PAGE for 
Q detection of reactivity to ELP antibodies. In every analysis a strong reactive band was 
observed with an apparent molecular weight of approximately 90 kD. 

pPT0370 SELPOK-CSl 934 AA MW 76,389 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
:5 [(GAGAGS)2 (GVGVP)i LGPLGP (GVGVPlj GKGVP (GVGVP) 3] 

{ GAGAGS ) 2 GAGAMDPGRYQDLRSHHHHHH 

As is evident from the above results, highly repetitive sequences can be 
prepared, cloned, and used for expression to produce a wide variety of products which 
iO may mimic natural products, such as silk and other proteins and antigens . In addition, 
novel systems are provided for controlling the expression of the peptide under inducible 
conditions in a variety of hosts. In this manner, new proteinaceous products can be 
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provided which provide for new properties or may closely mimic the propenies of 
naturally occurring products. 
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All publications and patent applications mentioned in this specification are 
indicative of the level of skill of those skilled in the an to which this invention pertains. 
All publications and patent applications are herein incorporated by reference: to the same 
extent as if each individual publication or patent application was specifically and 
5 individually indicated to be incorporated by reference. 

The invention now being fiilly described, it will be apparent to one of ordinary 
skill in the art that many changes and modifications can be made thereto without 
departing from the spirit or scope of the appended claims. 
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W^ATJf^ PT ATMFDTS: 

1. A method of preparing a synthetic DNA sequence having repeating units of 
from about 3 to 15 codons and encoding a protein of at least about 30 kDal, said 
5 method comprising: 

(1) synthesizing at least two different pairs of single stranded oligomers 
wherein each of the oligomers of a pair overlap except as to any protruding ends; 

(2) hybridizing each pair of single stranded oligomers to provide double 
10 stranded segments; 

(3) combining said segments or amplified copies thereof in a cloning vector 
to form a monomer, where the combined segments are in reading frame; 

15 (4) excising said monomer from said cloning vector by restriction enzyme 

digestion; and 

(5) oligomerizing said monomer to provide a multimer comprising at least 
two monomers. 

20 

2. A method according to Claim 1, wherein at least one said restriction 
enzyme cuts at an asynuneuic consensus sequence or at a site distal from the consensus 
sequence. 

25 3. A method according to Claim 1, wherein said monomer has protruding 

termini which are com.plementary to each other. 

4. A method according to Claim 1, wherein said pairs of oligomers includes 
at least two different pairs encoding the same amino acid sequence. 

30 

5. A method according to Claim 1 , wherein said pairs of oligomers includes 
at least two different pairs encoding different amino acid sequences. 
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6. A method according to Claim 1, wherein the number of pairs of 
oligomers is in the range of 2 to 4 and at least one oligomer has protruding ends on the 
same strand. 

5 7. A method according to Claim 1 , wherein said repeating units have from 

4 to 12 codons. 

8. A method according to Claim 1, wherein at least a poition of said 
monomer is sequenced prior to oligomerizing to provide said multimer. 

10 

9. A method of preparing a synthetic DNA sequence having repeating units 
of from about 3 to 15 codons and encoding a protein of at least about 30 kDal, said 
method comprising: 

(1) synthesizing at least three different pairs of single stranded oligomers 
15 wherein each of the oligomers of a pair overlap except as to any protruding ends; 

(2) hybridizing each pair of single stranded oligomers to provide double 
stranded segments each having from 21 to 90 bases to provide at least three segments, 
each segment having a different nucleic acid sequence and having complementary ends 

20 to contiguous segments; 

(3) combining said segments or amplified copies thereof in a cloning vector 
to form a monomer, where the combined segments are in reading frame; 

25 (4) excising said monomer from said cloning vector by restriction enzyme 

digestion; and 

(5) oligomerizing said monomer to provide a multimer comprising at least 
two monomers. 

30 

10. A method according to Claim 9, including the additional step of 
analyzing said monomer prior to said oligomerizing. 

100 



wo 98/10063 



PCT/US96/15306 



11. A method according to Claim 9, wherein at least one repeating unit 
encodes an amino acid sequence consisting of: 

5 Gap, GAGAGS. GVGVP, VPGVG, SGAGAG, and AGAGSC5, 

wherein a and P are any amino acid, a and P being selected so that the encoded protein 
comprises from about 10 to 45 number % proline. 

10 12. A method of preparing a DNA sequence having repeating units of from 

about 3 to IS codons and encoding a protein of at least about 30kDaI, said method 
comprising: 

(1) synthesizing at least three different pairs of single stranded oligomers of 
15 from about 30 to 100 bases, wherein each of the oligomers of a pair overlap except as 

to any protruding ends; 

(2) hybridizing said pairs of oligomers to provide segments; 
20 (3) inserting a first segment into a linearized cloning vector; 

(4) sequencing said first segment to ensure the fidelity of said sequence; 

(5) sequentially linearizing said vector with a restriction enzyme cleaving 
25 proximal to a terminus of said segment, and adding additional segments at a terminus 

of the prior segment, by digesting said cloning vector with a restriction c:nzyme cutting 
at a site proximal to a terminus of the preceding segment, insening each successive 
segment in reading frame with the prior segment, and cloning the cloning vector 
comprising said next segment, to provide a monomer; 

30 

(6) excising said monomer from said cloning vector; 
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(7) oligomerizing said monomer to provide at least one multimer comprising 
at least two monomers; 

wherein the sequences of said segments and vector are selected to permit 
insertion of said segments and excision of said monomer by restriction enzyme digestion 
5 with restriction enzymes which cleave at asymmetric consensus sequences or distal from 
the consensus sequence. 

13. A method according to Claim 12, including the additicnal step of 
analyzing said monomer prior to said oligomerizing. 

10 

14. A method according to Claim 12, wherein at least one rqpeating unit 
encodes an amino acid sequence consisting of: 

Gap. GAGAGS, GVGVP, VPGVG, SGAGAG. and AGAGSG. 

15 

wherein a and P are any amino acid, a and P being selected so that the encoded protein 
comprises from about 10 to 45 number % proline. 

15. A method of preparing a synthetic DNA sequence having repeating units 
20 of from about 3 to 15 codons and encoding a protein of at least about 3Ci kDal, said 

method comprising: 

(1) synthesizing at least two different pairs of single stranded oligomers 
wherein each of the oligomers of a pair overlap except as to any protruding ends; 

25 

(2) hybridizing each pair of single stranded oligomers to provide double 
stranded segments; 

(3) cloning a first segment in a cloning vector and analyzing the cloned first 
30 segment to determine the fidelity of the sequence, discarding any segment having an 

erroneous sequence; 
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(4) either: 

(a) adding each successive segment in reading frame to prior 
segments to provide a monomer and determining the fidelity of the sequence of each 

5 successive segment; or 

(b) cloning each successive segment in a cloning vector and 
analyzing each successive segment to determine the fidelity of the sequence and 
combining said segments or amplified copies thereof in a cloning vector to form a 

10 monomer, where the combined segments are in reading frame; 

(5) excising said monomer from said cloning vector by restriction enzyme 
digestion; and 

15 (6) oligomerizing said monomer to provide a multimer comprising at least 

two monomers. 

16. A method according to Claim 15. including the additional step of 
analyzing said monomer prior to said oligomerizing. 

20 

17. A method according to Claim 15, wherein at least one repeating unit 
encodes an amino acid sequence consisting of: 

Gap, GAGAGS, GVGVP, VPGVG, SGAGAG, and AGAGS(3. 

25 

wherein a and P are any amino acid, a and P being selected so that the encoded protein 
comprises from about 10 to 45 number % proline. 

18. A method of preparing a syndietic DNA sequence having repeating units 
30 of from about 3 to 15 codons and encoding a protein of at least aboui: 30 kDal, said 

method comprising: 
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(1) synthesizing at least two different pairs of single strandtjd oligomers 
wherein each of the oligomers of a pair overlap except as to any protruding ends; 

(2) hybridizing each pair of single stranded oligomers to provide double 
5 stranded segments; 

(3) isolating at least one double stranded segment from a previously 
synthesized monomer; 

10 (4) combining said segments or amplified copies thereof in a cloning vector 

to form a monomer, where the combined segments are in reading frame; 

(5) excising said monomer from said cloning vector by restriction enzyme 
digestion; and 

15 

(6) oligomerizing said monomer to provide a multimer comprising at least 
two monomers; 

wherein synthesized segments are sequenced to ensure the fidelity of replication. 

20 

19. A method according to Claim 18, including the additional step of 
analyzing said monomer prior to said oligomerizing. 

20. A method according to Claim 18, wherein at least one repealing unit 
25 encodes an amino acid sequence consisting of: 

Gap, GAGAGS, GVGVP, VPGVG, SGAGAG, and AGAGSG, 

wherein a and P are any amino acid, a and P being selected so that the encoded protein 
30 comprises from about 10 to 45 number % proline. 
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21 . A method of preparing a synthetic DNA sequence having repeating units 
of from about 3 to 15 codons and encoding a protein of at least about 30 kDal, said 
method comprising: 

5 (1) synthesizing a single strand encoding a monomer comprising from about 

100 to 300 bases comprising repeating units of from about 3 to 15 codons flanked by 
primer binding sites; 

(2) preparing a complementary strand and amplifying the resultant dsDNA 
10 monomer by polymerase chain reaction using primers of from 15 to 50 nt having a 

sequence comprising a sequence which does not hybridize with the repetitive portion 
of the sequence of said single strand and further comprising none or a ]X)rtion of the 
consensus sequence of a restriction site, the single strand completing i:he consensus 
sequence where the primer comprises a portion of the restriction site, wherein said 
15 primers are characterized by being different for each end of the dsDNA, hiaving at least 
40%GC, a T„ of at least 85*C when hybridizing with the complementa]7 primer and 
the differing between the two primers by not more than 2*'C; 

(3) cloning said dsDNA monomer and selecting for a monomer having the 
20 correct sequence; and 

(4) oligomerizing said monomer to provide at least one multimer comprising 
at least two monomers. 

25 22. A method according to Claim 21, wherein at least one repeating unit 

encodes an amino acid sequence consisting of: 

Gap, GAGAGS, GVGVP, VPGVG, SGAGAG, and AGAGSG, 

30 wherein a and p are any amino acid, a and P being selected so that the encoded 

protein comprises from about 10 to 45 number % proline. 
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23. A method according to Claim 21, comprising the fuither steps of 
removing said primers with a restriction enzyme resulting in protruding ends and 
excising said dsDNA monomer after cloning with a restriction enzyme resulting in 
protruding ends. 

5 

24. A method of preparing a synthetic DNA sequence having repeating units 
of from about 3 to IS codons and encoding a protein of at least about 30 kDaU said 
method comprising: 

10 (1) synthesizing at least two different pairs of single stranded oligomers of 

wherein each of the oligomers of a pair overlap except as to any protruding ends; 

(2) hybridizing each pair of single stranded oligomers to pirovide double 
stranded segments; 

15 

(3) combining said segments or amplified copies thereof in a c:loning vector 
to form a monomer, where the combined segments are in reading frame:; 

(4) excising said monomer from said cloning vector by restriction enzyme 
20 digestion; 

(5) oligomerizing said monomer to provide a multimer compirising at least 
two monomers; 

25 wherein the sequences of said segments and vector are selected to permit 

insenion of said segments and excision of said monomer by restriction enzyme 
digestion; 

(6) inserting said multimer in an expression vector functional For expression 
30 in an expression host: 



(7) introducing said expression vector into said expression host; and 
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growing said expression host, whereby said protein polymer is expressed. 

25. A method according to Claim 24, including the additional step of 
purifying said expressed protein polymer. 

5 

26. A method according to 24, wherein at least one repeating unit encodes 
an amino acid sequence consisting of: 

Gap, GAGAGS, GVGVP, VPGVG, SGAGAG. and AGAGS(3, 

10 

wherein a and p are any amino acid, a and P being selected so that the encoded protein 
comprises from about 10 to 45 number % proline. 

27. A method according to Claim 24, wherein said expression host is £. 

15 coli. 

28. A method of preparing a synthetic DNA sequence having repeating units 
of from about 3 to 15 codons and encoding a protein of at least about 30 kDal. said 
method comprising: 

20 

(1) synthesizing a single strand encoding a monomer comprisiing from about 
100 to 300 bases comprising repeating units of from about 3 to 15 codons flanked by 
primer binding sites; 

25 (2) preparing a complementary strand and amplifying the resultant dsDNA 

monomer by polymerase chain reaction using primers of from 20 to 45 nt having a 
sequence wherein said primer does not hybridize with the repetitive portion of the 
sequence of said single strand and comprising none or a portion of a restriction site 
proximal to the terminus of said primer, wherein said primers are ctiaracterized by 

30 being different for each end of the dsDNA, having at least 40%GC, a of at least 
85 °C with the between the two primers differing by not more than 2*C and; 
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(3) cloning said dsDNA monomer and selecting for a monomer having the 
correct sequence; 

(4) oligomerizing said monomer to provide at least one multimei: comprising 
5 at least two monomers; 

(5) inserting said multimer in an expression vector functional for expression 
in an expression host; 

10 (6) introducing said expression vector into said expression host; and 

growing said expression host, whereby said protein polymer is expressed. 

29. A method according to Claim 28 » including the additional step of 
purifying said expressed protein polymer. 

15 

30. A method according to 28, wherein at least one repeating unit encodes 
an amino acid sequence consisting of: 

Gap. GAGAGS, GVGVP, VPGVG, SGAGAG, and AGAGSG, 

20 

wherein a and p are any amino acid, a and P being selected so that the encoded protein 
comprises from about 10 to 45 number % proline. 

31. A method according to Claim 28, wherein said expression host is E. 

25 colL 

32. A method of preparing a synthetic DNA sequence having rep eating units 
of from about 3 to 15 codons and encoding a protein of at least about 30 kDal, said 
method comprising: 

30 (1) synthesizing a single strand encoding a first segment of a monomer, 

wherein said monomer comprises from about 100 to 300 bases comprising repeating 
units of from about 3 to 15 codons; 
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(2) preparing a complementary strand and amplifying the resultant first 
dsDNA segment by polymerase chain reaction using primers of from 15 tc 50 nt having 
a sequence comprising a sequence which does not hybridize with the rep<!titive portion 
of the sequence of said single strand and further comprising none or a portion of the 

5 consensus sequence of a restriction site, the single strand completing iLhe consensus 
sequence where the primer comprises a portion of the restriction site, wherein said 
primers are characterized by being different for each end of the dsDNA. liaving at least 
40%GC, a T„, of at least 85'C when hybridizing with the complementary primer and 
the differing between the two primers by not more than 2**C; 

10 

(3) cloning said first dsDNA segment into a first cloning vector and 
selecting for a second double stranded DNA segment having the correct: sequence; 

(4) excising said second double stranded segment from said first cloning 
15 vector by restriction enzyme digestion; 

(5) isolating a third double stranded segment from a previously synthesized 
monomer; 

20 (6) combining said second and third double stranded segmenis or amplified 

copies thereof in a second cloning vector to form a monomer, wherein the combined 
segments are in reading frame and wherein said monomer comprises fiom about 100 
to 300 bases comprising repeating units of from about 3 to 15 codons; 

25 (7) excising said monomer from said second cloning vector by restriction 

enzyme digestion; and 

(8) oligomerizing said monomer to provide a multimer comprising at least 
two monomers. 

30 

33. The method according to Claim 32 wherein said previoujjly synthesized 
monomer is prepared according to the method comprising: 
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(1) synthesizing at least two different pairs of single stranded oligomers 
wherein each of the oligomers of a pair overlap except as to any protruding ends; 

(2) hybridizing each pair of single stranded oligomers to provide double 
5 stranded segments; 

(3) combining said segments or amplified copies thereof in a cloning vector 
to form a monomer, where the combined segments are in reading frame; 

10 (4) excising said monomer from said cloning vector by restriction enzyme 

digestion; and 



(5) oligomerizing said monomer to provide a multimer comprising at least 
two monomers. 

15 

34. The method according to Claim 32 wherein said previosuly synthesized 
monomer is prepared according to the method comprising: 

(1) synthesizing a single strand encoding a monomer comprising from about 
20 100 to 300 bases comprising repeating units of from about 3 to 15 codons; flanked by 
primer binding sites; 



(2) preparing a complementary strand and amplifying the resuliant dsDNA 
monomer by polymerase chain reaction using primers of from 15 to 50 nt having a 

25 sequence comprising a sequence which does not hybridize with the repetitive portion 
of the sequence of said single strand and further comprising none or a portion of the 
consensus sequence of a restriction site, the single strand completing the consensus 
sequence where the primer comprises a portion of the restriction site, wherein said 
primers arc characterized by being different for each end of the dsDNA, having at least 

30 40%GC, a T„ of at least 85**C when hybridizing with the complementary primer and 
the T„ differing between the two primers by not more than 2**C; 
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(3) cloning said dsDNA monomer and selecting for a monomer having the 
correct sequence; and 

(4) oligomeriztng said monomer to provide at least one multinier comprising 
5 at least two monomers. 
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